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Abstract 



(N 

Program specialization is a program transformation methodology which improves program efficiency 
by exploiting the information about the input data which are available at compile time. We show 
that current techniques for program specialization based on partial evaluation do not perform well 
on nondctcrministic logic programs. We then consider a set of transformation rules which extend 
■ the ones used for partial evaluation, and we propose a strategy for guiding the application of 

these extended rules so to derive very efficient specialized programs. The efficiency improvements 
which sometimes are exponential, are due to the reduction of nondeterminism and to the fact 
that the computations which are performed by the initial programs in different branches of the 
computation trees, are performed by the specialized programs within single branches. In order to 
(*C) , reduce nondeterminism we also make use of mode information for guiding the unfolding process. 

To exemplify our technique, we show that we can automatically derive very efficient matching 
programs and parsers for regular languages. The derivations we have performed could not have 
O ■ been done by previously known partial evaluation techniques. 

•rH . 

X ' 1 Introduction 



The goal of program specialization [2^ is the adaptation of a generic program to a specific context of 
use. Partial evaluation [7J is a well established technique for program specialization which from 
a program and its static input (that is, the portion of the input which is known at compile time), 
allows us to derive a new, more efficient program in which the portion of the output which depends 
on the static input, has already been computed. Partial evaluation has been applied in several areas 
of computer science, and it has been applied also to logic programs ^3 [2H1 1239) where it is also 
called partial deduction. In this paper we follow a rule-based approach to the specialization of logic 
programs @J EH1 EH 1^ • In particular, we consider definite logic programs [2H1 and we propose new 
program specialization techniques based on unfold/fold transformation rules In our approach, 

the process of program specialization can be viewed as the construction of a sequence, say Pq, . . . , P n , 
of programs, where Pq is the program to be specialized, P n is the derived, specialized program, and 
every program of the sequence is obtained from the previous one by applying a transformation rule. 

*A preliminary version of this paper appears as: Reducing Nondeterminism while Specializing Logic Programs. 
Proceedings of the 24th Annual ACM Symposium on Principles of Programming Languages, Paris, France, January 
15-17, 1997, ACM Press, 1997, pp. 414-427. 
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As shown in |36[ I41j. partial deduction can be viewed as a particular rule-based program trans- 
formation technique using the definition, unfolding, and folding rules [IB] with the following two 
restrictions: (i) each new predicate introduced by the definition rule is defined by precisely one non- 
recursive clause whose body consists of precisely one atom (in this sense, according to the terminology 
of ^>1> partial deduction is said to be monogenetic), and (ii) the folding rule uses only clauses in- 
troduced by the definition rule. In what follows the definition and folding rules which comply with 
restrictions (i) and (ii), are called atomic definition and atomic folding, respectively. 

In Section 01 we will see that the use of these restricted transformation rules makes it easier 
to automate the partial deduction process, but it may limit the program improvements which can 
be achieved during program specialization. In particular, when we perform partial deduction of 
nondeterministic programs using atomic definition, unfolding, and atomic folding, it is impossible to 
combine information present in different branches of the computation trees, and as a consequence, it 
is often the case that we cannot reduce the nondeterminism of the programs. 

This weakness of partial deduction is demonstrated in Section 13.31 where we revisit the familiar 
problem of looking for occurrences of a pattern in a string. It has been shown in 1131 115j that by 
partial deduction of a string matching program, we may derive a deterministic finite automaton (DFA, 
for short), similarly to what is done by the Knuth-Morris-Pratt algorithm [22] • However, in |11 | I13 [ IT5] 
the string matching program to which partial deduction is applied, is deterministic. We show that by 
applying partial deduction to a nondeterministic version of the matching program, one cannot derive 
a specialized program which is deterministic, and thus, one cannot get a program which corresponds 
to a DFA. 

Conjunctive partial deduction 8 is a program specialization technique which extends partial de- 
duction by allowing the specialization of logic programs w.r.t. conjunctions of atoms, instead of a single 
atom. Conjunctive partial deduction can be realized by the definition, unfolding, and folding rules 
where each new predicate introduced by the definition rule is defined by precisely one non-recursive 
clause whose body is a conjunction of atoms (in this sense conjunctive partial deduction is said to be 
poly genetic). 

Conjunctive partial deduction may sometimes reduce nondeterminism. In particular, it may trans- 
form generate-and-test programs into programs where the generation phase and the test phase are 
interleaved. However, as shown in Section [3.31 conjunctive partial deduction is not capable to derive 
from the nondeterministic version of the matching program a new program which corresponds to a 
DFA. 

In our paper, we propose a specialization technique which enhances both partial deduction and 
conjunctive partial deduction by making use of more powerful transformation rules. In particular, in 
Section [1] we consider a version of the definition introduction rule so that a new predicate may be 
introduced by means of several non-recursive clauses whose bodies consist of conjunctions of atoms, 
and we allow folding steps which use these predicate definitions consisting of several clauses. We also 
consider the following extra rules: head generalization, case split, equation elimination, and disequation 
replacement. These rules may introduce, replace, and eliminate equations and negated equations 
between terms. 

Similarly to |14| I4(il I4()j . our extended set of program transformation rules preserves the least 
Herbrand model semantics. For the logic language with equations and negated equations considered in 
this paper, we adopt the usual Prolog operational semantics with the left-to-right selection rule, where 
equations are evaluated by using unification. Unfortunately, the unrestricted use of the extended set 
of transformation rules may not preserve the Prolog operational semantics. To overcome this problem, 
we consider: (i) the class of safe programs and (ii) suitably restricted transformation rules, called safe 
transformation rules. Through some examples we show that the class of safe programs and the safe 
transformation rules are general enough to allow significant program specializations. 

Our notions of safe programs and transformation rules, and also the notion of determinism are 
based on the modes which are associated with predicate calls [321 HHj ■ We describe these notions in 
Section [SJ where we also prove that the application of safe transformation rules preserve the operational 
semantics of safe programs. 

Then, in Section ® we introduce a strategy, called Determinization, for applying our safe trans- 
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formation rules in an automatic way, so to specialize programs and reduce their nondeterminism. 
The new features of our strategy w.r.t. other specialization techniques are: (i) the use of mode in- 
formation for unfolding and producing deterministic programs, (ii) the use of the case split rule for 
deriving mutually exclusive clauses (e.g. from the clause H <— Body we may derive the two clauses: 
(H <— Body){X/t} and H <— X ^ t, Body), and (hi) the use of the enhanced definition and folding 
rules for replacing many clauses by one clause only, thereby reducing nondeterminism. 

Finally, in Section [JJ we show by means of some examples which refer to parsing and matching 
problems, that our strategy is more powerful than both partial deduction and conjunctive partial 
deduction. In particular, given a nondeterministic version of the matching program, one can derive 
by using our strategy a specialized program which corresponds to a DFA. 

2 Logic Programs with Equations and Disequations between Terms 

In this section we introduce an extension of definite logic programs with equations and negated equa- 
tions between terms. Negated equations will also be called disequations. The introduction of equations 
and disequations during program specialization allows us to derive mutually exclusive clauses. The 
declarative semantics we consider, is a straightforward extension of the usual least Herbrand model 
of definite logic programs. The operational semantics essentially is SLD-resolution as implemented 
by most Prolog systems: atoms are selected from left to right, and equations are evaluated by us- 
ing unification. This operational semantics is sound w.r.t. the declarative semantics (see Theorem |2 
below). However, since non-ground disequations can be selected, a goal evaluated according to our 
operational semantics can fail, even if it is true according to the declarative semantics. In this sense, 
the operational semantics is not complete w.r.t. the declarative semantics. 

For the notions of substitution, composition of substitutions, identity substitution, domain of a 
substitution, restriction of a substitution, instance, most general unifier (abbreviated as mgu), ground 
expression, ground substitution, renaming substitution, variant, and for other notions not defined here, 
we refer to [25] . 

2.1 Syntax 

The syntax of our language is defined starting from the following infinite and pairwise disjoint sets: 

(i) variables: X, Y, Z, X\,X2, ■ ■ ■ , 

(ii) function symbols (with arity): /, fi, fa, ■ ■ ■ , and 

(hi) predicate symbols (with arity): true, =, 7^, p,p\,p%, ■ ■ ■ The predicate symbols true, =, and 7^ are 
said to be basic, and the other predicate symbols are said to be non-basic. Predicate symbols will also 
be called predicates, for short. 

Now we introduce the following sets: (iv) Terms: t,t\,t2,..., (v) Basic atoms: B, B\, B2, ■ ■ ■ , 
(vi) Non-basic atoms: A, A\,A2, ■ ■ ■ , and (vii) Goals: G, G\, G2, ■ ■ ■ Their syntax is as follows: 



Basic and non-basic atoms are collectively called atoms. Goals made out of basic atoms only are said 
to be basic goals. Goals with at least one non-basic atom are said to be non-basic goals. The binary 
operator ',' denotes conjunction and it is assumed to be associative with neutral element true. Thus, 
a goal G is the same as goal {true, G), and it is also the same as goal (G, true). 

Clauses: C, C%, C2, ■ ■ ■ have the following syntax: 

C ;■= A *— G 

Given a clause C of the form: A <— G, the non-basic atom A is called the head of C and it is denoted 
by hd(C), and the goal G is called the body of C and it is denoted by bd(C). A clause A <— G where 
G is a basic goal, is called a unit clause. We write a unit clause of the form: A <— true also as: A <— . 
We say that C is a clause for a predicate p iff C is a clause of the form p{. ..)•*— G. 



Terms : 
Basic Atoms : 
Non-basic Atoms : 
Goals : 



t : 
B 
A 
G 



X I f(h, . . . ,t n ) 
- true I t\ = t2 I t\^t2 
-- p(ti, ... ,t m ) 
--B\A\G 1 ,G 2 
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Programs: P, P\, P%, . . . are sets of clauses. 

In what follows we will feel free to use different meta-variables to denote our syntactic expressions, and 
in particular, we will also denote non-basic atoms by H, H±, . . ., and goals by K, K\, Body, Body 1 , . . . 

Given a program P, we consider the relation 5p over pairs of predicates such that Sp(p,q) holds 
iff there exists in P a clause for p whose body contains an occurrence of q. Let 5p~ be the transitive 
closure of dp. We say that p depends on q in P iff Sp~(p,q) holds. We say that a predicate p depends 
on a clause C in a program P iff either C is a clause for p or C is a clause for a predicate q and p 
depends on q in P. 

Terms, atoms, goals, clauses, and programs are collectively called expressions, ranged over by 
e, ei, &ii ■ ■ ■ By vars{e) we denote the set of variables occurring in an expression e. We say that X is 
a local variable of a goal G in a clause C : H <— G\ , G, G2 iff X € vars(G) — vars \H , G\ , G2 ) • 

The application of a renaming substitution to an expression is also called a renaming of variables. 
A renaming of variables can be applied to a clause whenever needed, because it preserves the least 
Herbrand model semantics which we define below. Given a clause C, a renamed apart clause C' is any 
clause obtained from C by a renaming of variables, so that each variable of C' is a fresh new variable. 
(For a formal definition of this concept, see the definition of standardized apart clause in [Tl 128]) 

For any two unifiable terms t\ and £2, there exists at least one mgu i? which is relevant (that is, each 
variable occurring in 1? also occurs in vars (£i)U vars (£2)) an d idempotent (that is, = 1?) fTf. Without 
loss of generality, we assume that all mgu's considered in this paper are relevant and idempotent. 

2.2 Declarative Semantics 

In this section we extend the definition of least Herbrand model of definite logic programs I2B] to logic 
programs with equations and disequations between terms. We follow the approach usually taken when 
defining the least T>- model of CLP programs (see, for instance, [20 ). According to this approach, we 
consider a class of Herbrand models, called TL-models, where the predicates true, =, and 7^ have a fixed 
interpretation. In particular, the predicate = is interpreted as the identity relation over the Herbrand 
universe and the predicate ^ is interpreted as the complement of the identity relation. Then we define 
the least Herbrand model of a logic program with equations and disequations between terms as the 
least 7^-model of the program. 

The Herbrand base TLB is the set of all ground non-basic atoms. An "H-interpretation is a subset 
of TLB. Given an ^-interpretation / and a ground goal, or ground clause, or program ip, the relation 
I \= if, read as ip is true in I, is inductively defined as follows (as usual, by I ty= ip we indicate that 
I \= ip does not hold): 

(i) I \= true 

(ii) for every ground term t, I \= t = t 

(iii) for every pair of distinct ground terms £1 and £2, / \= £1 7^2 

(iv) for every non-basic ground atom A, I (= A iff A E I 

(v) for every pair of ground goals G\ and G2 , I \= G\ , G2 iff I \= G\ and I \= G2 

(vi) for every ground clause C, I \= C iff either I \= hd(C) or / y= bd(C) 

(vii) for every program P , I \= P iff for every ground instance C of a clause in P, I \= C. 

As a consequence of the above definition, a ground basic goal is true in an 7-^-interpretation iff it is true 
in all "H-interpretations. We say that a ground basic goal holds iff it is true in all "H-interpretations. 

An ^-interpretation / is said to be an TL-model of a program P iff / |= P. Since the model 
intersection property holds for 7^-models, similarly to |20] 128] . we can prove the following important 
result. 

Theorem 1 For any program P there exists an 7^-model of P which is the least (w.r.t. set inclusion) 
7^-model. 

The least Herbrand model of a program P is defined as the least TL-model of P and is denoted by 
M(P). 



4 



2.3 Operational Semantics 



We define the operational semantics of our programs by introducing, for each program P, a relation 
Gi i — >p G 2 , where G\ and G% are goals and $ is a substitution, defined as follows: 



(3) (A,G) (bd(C),G)ti iff (i) A is a non-basic atom, 

(ii) C is a renamed apart clause in P, and 

(iii) A and hd(C) are unifiable via an mgu 

A sequence Go i — >p ■ ■ ■ ^-^p G n , with n>0, is called a derivation using P. If G n is £rae then the 
derivation is said to be successful. If there exists a successful derivation Go i— ^>p ■ ■ ■ h^->p true and 
i? is the substitution obtained by restricting the composition i?i ...!?„ to the variables of Go, then we 
say that the goal Go succeeds in P with answer substitution 

When denoting derivations, we will feel free to omit their associated substitutions. In particular, 
given two goals G\ and G2, we write G\ 1 — >p G2 iff there exists a substitution d such that G\ * — >p G 2 . 
We say that G2 is derived in one step from G\ {using P) iff G\ 1 — >p G2 holds. In particular, if G2 is 
derived in one step from G\ according to Point (3) of the operational semantics by using a clause G, 
then we say that G2 is derived in one step from G\ using C. The relation 1 — >* p is the reflexive and 
transitive closure of 1 — >p. Given two goals G\ and G2 such that G\ 1 — >* P G2 holds, we say that G2 is 
derived from G\ (using P). We will feel free to omit the reference to program P when it is understood 
from the context. 

The operational semantics presented above can be viewed as an abstraction of the usual Prolog 
semantics, because: (i) given a goal Gi, in order to derive a goal G2 such that G\ 1 — >p G2, we 
consider the leftmost atom in G±, (ii) the predicate = is interpreted as unifiability of terms, and 
(iii) the predicate 7^ is interpreted as non- unifiability of terms. Similarly to [2B]> we have the following 
relationship between the declarative and the operational semantics. 

Theorem 2 For any program P and ground goal G, if G succeeds in P then M{P) \= G. 

The converse of Theorem El does not hold. Indeed, consider the program P consisting of the clause 
p(l) <— only. We have that M(P) \= p(l) because there exists a value for X, namely 1, which 

is syntactically different from 0. However, p{\) does not succeed in P, because X and are unifiable 
terms. 

2.4 Deterministic Programs 

Various notions of determinism have been proposed for logic programs in the literature (see, for 
instance, ^3 El ISIS])- They capture various properties such as: "the program succeeds at most 
once" , or "the program succeeds exactly once" , or "the program will never backtrack to find alternative 
solutions" . 

Let us now present the definition of deterministic program used in this paper. This definition is 
based on the operational semantics described in Section f2.3l 

We first need the following notation. Given a program P, a clause G G P, and two goals (Aq, Go) 
and (A n , G n ), where Aq is a non-basic atom, we write (Aq, Go) =4>c (^n, G n ) iff there exists a derivation 
(Aq, Go) 1 — >p ■ ■ ■ 1 — >p (A n , G n ), such that: (i) n>0, (ii) (A±, Gi) is derived in one step from (Aq, Go) 
using C, (iii) for i = l,...,n — 1, Aj is a basic atom, and (iv) either A n is a non-basic atom or 
(A n , G n ) is the basic atom true. We write Go =^p G n iff there exist clauses Gi, . . . , G n in P such that 
Go =>Ci ■ ■ ■ =^c n G n - 

Definition 1 (Determinism) A program P is deterministic for a non-basic atom A iff for each goal 
G such that A =>p G, there exists at most one clause C such that G =^c G' for some goal G' . 



(1) (h=t 2 , G) 

(2) (h + t 2 , G) 




iff 
iff 



ti and t 2 are unifiable via an mgu 1? 

ti and t 2 are not unifiable and e is the identity substitution 
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We say that a program P is nondeterministic for a non-basic atom A iff it is not the case that P is 
deterministic for A, that is, there exists a goal G derivable from A, and there exist at least two goals 
G\ and G2, and two distinct clauses C% and C 2 in P, such that G =>Ci G\ and G =>c 2 ^2- 

According to Definition ^ the following program is deterministic for any atom of the form 
non_zero(Xs, Ys) where Xs is a ground list. 

1. non_zero{[ ],[])<— 

2. nonzero ([0|Xs], Ys) <— non-zero{Xs, Ys) 

3. non_2;ero([A A |Xs], [X| Ys]) <— X^O, non_zero{Xs, Ys) 

Notice that the above definition of a deterministic program for a non-basic atom A allows some search 
during the construction of a derivation starting from A. Indeed, there may be a goal G derived from A 
such that from G we can derive in one step two or more new goals using distinct clauses. However, if 
the program is deterministic for A, after evaluating the basic atoms occurring at leftmost positions in 
these new goals, at most one derivation can be continued and at most one successful derivation can be 
constructed. For instance, from the goal non_zero([0, 0, 1], Ys) we can derive in one step two distinct 
goals: (i) non_zero([0, 1], Ys) (using clause 2), and (ii) 7^ 0, non-Zero([Q, 1], Ys') (using clause 3). 
However, there exists only one clause C (that is, clause 2) such that non-zero([0, 0, 1], Ys) =^c G' for 
some goal G' (that is, non_zero([0, 1], Ys')). 

3 Partial Deduction via Unfold/Fold Transformations 

In this section we recall the rule-based approach to partial deduction. We also point out some limi- 
tations of partial deduction |3fi| 141] and conjunctive partial deduction [Hj. These limitations motivate 
the introduction of the new, enhanced rules and strategies for program specialization presented in 
Sections |U El and El 

3.1 Transformation Rules and Strategies for Partial Deduction 

In the rule-based approach, partial deduction can be viewed as the construction of a sequence Pq, . . . , P n 
of programs, called a transformation sequence, where Po is the initial program to be specialized, P n is 
the final, specialized program, and for k = 0, . . . , n — 1, program Pk+i is derived from program P^ by 
by applying one of the following transformation rules PD1-PD4. 

Rule PD1 (Atomic Definition Introduction) We introduce a clause D, called atomic definition 
clause, of the form 

newp(Xi, . . .,X h ) <- A 

where (i) newp is a non-basic predicate symbol not occurring in Pq, . . . , P/., (ii) A is a non-basic atom 
whose predicate occurs in program Pq, and (iii) {X±, . . . ,Xh} = vars(A). 
Program Pk+i is the program U {D}. 

We denote by Defsk the set of atomic definition clauses which have been introduced by the defi- 
nition introduction rule during the construction of the transformation sequence Pq, . . . , P^. Thus, in 
particular, we have that Defs = 0. 

Rule PD2 (Definition Elimination). Let p be a predicate symbol. By definition elimination w.r.t. 
p we derive the program Pk+i = {C G Pj. \ p depends on C}. 

Rule PD3 (Unfolding). Let C be a renamed apart clause of Pk of the form: H <— G\, A, G2, where A 
is a non-basic atom. Let C\, . . . , C m , with m > 0, be the clauses of Pk such that, for i = 1, . . . , m, A 
is unifiable with the head of C{ via the mgu By unfolding C w.r.t. A, for i = 1, . . . , m, we derive 
the clause A : (H «- G 1 ,bd(C i ),G 2 )$i. 
Program P k+ i is the program (P k - {C}) U {D 1 , ... , D m }. 

Rule PD4 (Atomic Folding). Let C be a renamed apart clause of P^ of the form: H <— Gi,A$, G2, 
where: (i) A is a non-basic atom, and (ii) 1? is a substitution, and let D be an atomic definition clause 
in Defsk of the form: iV <— A. By folding C w.r.t. A$ using D we derive the non-basic atom N$ and 
we derive the clause E : H <— Gi,N"&, G 2 - 
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Program Pk+i is the program (P& — {C}) U {E}. 

The partial deduction of a program P may be realized by applying the atomic definition intro- 
duction, definition elimination, unfolding, and atomic folding rules, according to the so called partial 
deduction strategy which we will describe below. Our partial deduction strategy uses two subsidiary 
strategies: (1) an Unfold strategy, which derives new sets of clauses by repeatedly applying the unfold- 
ing rule, and (2) a Define-Fold strategy, which introduces new atomic definition clauses and it folds the 
clauses derived by the Unfold strategy. These subsidiary strategies use an unfolding selection function 
and a generalization function, which we now define. Let us first introduce the following notation: 

(i) NBAtoms is the set of all non-basic atoms, (ii) Clauses is the set of all clauses, (iii) Clauses* is 
the set of all finite sequences of clauses, (iv) V (Clauses) is the powerset of Clauses, (v) a sequence of 
clauses is denoted by C\, . . . ,C n , and (vi) the empty sequence of clauses is denoted by (). 

An unfolding selection function is a total function Select : Clauses* x Clauses — > NBAtomsL){halt}, 
where halt is a symbol not occurring in NBAtoms. We assume that, for C\, . . . , C n G Clauses* and 
C G Clauses, Select((C\, . . . , C n ), C) is a non-basic atom in the body of C. 

When applying the Unfold strategy the Select function is used as follows. During the unfolding 
process starting from a set Cls of clauses, we consider a clause, say C, to be unfolded, and the 
sequence of its ancestor clauses, that is, the sequence C\, . . . ,C n of clauses such that: (i) C\ G Cls, 

(ii) for k = 1, ... , n— 1, Ck+i is derived by unfolding Ck, and (iii) C is derived by unfolding C n . Now, 
(i) if S elect ((C i, . . . , C n ), C) = A, where A is a non-basic atom in the body of C, then C is unfolded 
w.r.t. A, and (ii) if Select((C\, . . . , C n ),C) = halt then C is not unfolded. 

A generalization function Gen : V(Clauses) x NBAtoms — > Clauses is defined for any set Defs 
of atomic definition clauses and for any non-basic atom A. Gen(Defs, A) is either a clause in Defs 
or a clause of the form g(X\, . . . ,Xh) <— GenA, where: (i) {X\, . . . ,Xh} = vars(GenA), (ii) A is an 
instance of GenA, and (iii) g is a new predicate, that is, it occurs neither in P nor in Defs. 

When applying the Define-Fold strategy the generalization function Gen is used as follows: when 
we want to fold a clause C w.r.t. a non-basic atom A in its body, we consider the set Defs of all 
atomic definition clauses introduced so far and we apply the folding rule using Gen(Defs,A). This 
application of the folding rule is indeed possible because, by construction, A is an instance of the body 
of Gen{Defs,A). 



Partial Deduction Strategy 

Input: A program P and a non-basic atom p(t\, . . . ,th) w.r.t. which we want to specialize P. 
Output: A program P p d and a non-basic atom Ppd(Xi, . . . , X r ), such that: (i) {X\, . . . ,X r } = 
vars{p(t\, . . . , th)), and (ii) for every ground substitution $ = {X\/ui, . . . , X r /u r }, 

M(P) \=p(t 1} ...,t h )0 iff M(P pd ) \= Ppd (X 1} ...,X r )#. 

Initialize: Let S be the clause p p d(Xi, . . . , X r ) <— p{t\, . . . , th). Let Ancestors(S) be the empty 
sequence of clauses. 

TransfP := P; Defs := {S}; Cls := {S}; 
while Cls / i do 

(1) Unfold: 

while there exists a clause C G Cls with Select(Ancestors(C),C) ^ halt do 

Let Unf(C) = {E \ E is derived by unfolding C w.r.t. Select (Ancestors (C) ,C)} . 
Cls := {Cls - {C})U Unf(C); 

for each E G Unf(C) let Ancestors(E) be the sequence Ancestors(C) followed by C 
end-while; 

(2) Define-Fold: 

NewDefs := 0; 

while there exists a clause C G Cls and there exists a non-basic atom A G bd(C) which has not 
been derived by folding do 

Let G be the atomic definition clause Gen(Defs, A) and F be the clause derived by folding 
C w.r.t. A using G. 
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Cls := (Cls - {C}) U {F}; 

if G & Defs then (Defs := Defs U {G}; NewDefs := NewDefs U {G}) 
end-while; 

TransfP := TransfP U Cls; Cls := NewDefs 
end-while; 

We derive the final program P p d by applying the definition elimination rule and keeping only the 
clauses of TransfP on which p p & depends. 

A given unfolding selection function Select is said to be progressive iff for the empty sequence () 
of clauses and for any clause C whose body contains at least one non-basic atom, we have that 
Select((),C) ^ halt. 

We have the following correctness result which is a straightforward corollary of Theorem El of 
Section Ol 

Theorem 3 (Correctness of Partial Deduction w.r.t. the Declarative Semantics) 

Let Select be a progressive unfolding selection function. Given a program P and a non-basic atom 
p(t\, . . . ,th), if the partial deduction strategy using Select terminates with output program P pc i and 
output atom p p( i(X\, . . . , X r ), then for every ground substitution •& = {Xi/m, . . . , X r /u r }, 
M(P)\=p(t 1 ,...,t h )diSM(P pd )\=p pd (X 1 ,...,X r )0. 

We say that an unfolding selection function Select is halting iff for any infinite sequence C%, C2, . . . 
of clauses, there exists n > such that Select ((Ci, C%, . . . , C n ), C n +i) = halt. 

Given an infinite sequence A\, A2, ■ ■ ■ of non-basic atoms, its image under the generalization func- 
tion Gen, is the sequence of sets of clauses defined as follows: 

Gi = { 

newp(Xi, . . . , X n ) < — A\}, where {A'i, . . . , X n } — vars (A±) 
G l+1 = G i U{Gen(G l ,A i+1 )} for i > 1. 

We say that Gen is stabilizing iff for any infinite sequence A\ , A2, ■ ■ ■ of non-basic atoms whose 
image under Gen is G%, G2, ■ ■ ■ , there exists n > such that Gk = G n for all k > n. 
We have the following theorem whose proof is similar to the one in . 

Theorem 4 (Termination of Partial Deduction) Let Select be a halting unfolding selection func- 
tion and Gen be a stabilizing generalization function. Then for any input program P and non-basic 
atom p(t\, . . . , i/J, the partial deduction strategy using Select and Gen terminates. 

The following example shows that the unfolding rule (and thus, the partial deduction strategy) is 
not correct w.r.t. the operational semantics. 

Example 1 Let us consider the following program P\ : 

1. p^X^a, q(X) 

2. q(b)<- 

By unfolding clause 1 w.r.t. q(X) we derive the following program P%: 

3. p <— b^a 
2. q(b)<- 

We have that the goal p does not succeed in Pi , while it succeeds in P2 . 

We will address this correctness issue in detail in Sectional where we will present a set of trans- 
formation rules which are correct w.r.t. the operational semantics for the class of safe programs (see 
Theorem EJ) . 
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3.2 An Example of Partial Deduction: String Matching 

In this section we illustrate the partial deduction strategy by means of a well-known program special- 
ization example which consists in specializing a general string matching program w.r.t. a given pattern 
(see [111 1131 144j for a similar example) . Given a program for searching a pattern in a string, and a 
fixed ground pattern p, we want to derive a new, specialized program for searching the pattern p in a 
given string. Now we present a general program, called Match, for searching a pattern P in a string 
S in {a, b}* . Strings in {a, b}* are denoted by lists of a's and b's. This program is deterministic for 
atoms of the form match(P, S), where P and S are ground lists. 



Program Match 


(initial, deterministic) 


1. match(P, S) <- matchl{P, S, P, S) 




2. matchl([ },S,Y,Z) <- 




3. mafcM([C|P],[C|S],y,Z) <- matchl(P, S,Y,Z) 




4. mafcftl([o|P],[6|5] ) Y;[C|^]) «- matchlOT,Z,Y,Z) 




5. mateM([6|P],[a S],y,[C|Z]) <- matchl(Y,Z,Y t Z) 





Let us assume that we want to specialize this program Match w.r.t. the goal match ([a, a, b], S), that 
is, we want to derive a program which tells us whether or not the pattern [a, a, b] occurs in the string 
S. 

We apply our partial deduction strategy using the following unfolding selection function DetU and 
generalization function Variant. 

(1) The function DetU : Clauses* x Clauses — > NBAtoms U {/iaZi} is defined as follows: 

(1) DetU ((), C) = A if A is the leftmost non-basic atom in the body of clause C, 

(ii) DetU ((Ci, C2, . . . , C n ), C) = A if n > 1 and A is the leftmost non-basic atom the body of C such 
that A is unifiable with at most one clause head in the program to be partially evaluated, and 
(hi) DetU({C\, C%, . . . , C n ), C) = halt if there exists no non-basic atom in the body of C which is 
unifiable with at most one clause head in the program to be partially evaluated. 

(2) The function Variant : V(Clauses) x NBAtoms — » Clauses is defined as follows: 

(i) Variant (Defs, A) is a clause C such that bal(C) is a variant of A, if in Defs there exists any such 
clause C, and 

(ii) Variant(Defs, A) is the clause newp(Xi, . . . , Xh) <— A, where newp is a new predicate symbol and 
{Xi, . . . , Xh} = vars(A), otherwise. 

The function DetU corresponds to the determinate unfolding rule considered in ^21- We have that 
DetU is not halting and Variant is not stabilizing. Nevertheless, in our example, as the reader may 
verify, the partial deduction strategy using DetU and Variant terminates and generates the following 
specialized program: 



Pro 


Ijram Match p d 


(specialized by partial deduction, deterministic) 


6. 


matchpd(S) <— 


newl(S) 




7. 


newl([a|5]) <— 


new2(S) 




8. 


newl([b\S}) <- 


newl(S) 




9. 


new2([a\S}) <- 


new3(S) 




10. 


new2([b\S]) <- 


newl(S) 




11. 


new3([6 S]) «- 






12. 


neu>3([a 5"]) <— 


new3(S) 





The program Match p( i is deterministic for atoms of the form match p d(S), where S is a ground list, and 
it corresponds to a DFA in the sense that: (i) each predicate corresponds to a state, (ii) each clause, 
except for clause 6 and 11, corresponds to a transition from the state corresponding to the predicate 
of the head to the state corresponding to the predicate of the body, (hi) each transition is labelled by 
the symbol (either a or b) occurring in the head of the corresponding clause, (iv) by clause 6 we have 
that newl is the initial state for goals of the form match p d(w), where w is any ground list representing 



9 



a word in {a, b}* , and (v) clause 11 corresponds to a transition, labeled by b, to an unnamed final 
state where any remaining portion of the input word is accepted. 

Thus, via partial deduction we may derive a DFA from a deterministic string matching program, 
and the derived program corresponds to the Knuth- Morris-Pratt string matching algorithm |22| . 

3.3 Some Limitations of Partial Deduction 

The fact that the partial deduction strategy derives a DFA is a consequence of the fact that the 
initial string matching program Match is rather sophisticated and, indeed, the correctness proof of the 
program Match is not straightforward. Actually, the partial deduction strategy does not derive a DFA 
if we consider, instead of the program Match, the following naive initial program for string matching: 



Pro 


gram Naive-Match 


(initial, nondeterministic) 


1. 


naive _match(P, S) <— append (X, R, S), append(L, P, X) 




2. 


append([ ],Y,Y) <— 




3. 


append ([A\X],Y,[A\Z]) <- append(X,Y, Z) 





This program is nondeterministic for atoms of the form naive -match(P, S), where P and S are ground 
lists. The correctness of this naive program is straightforward because for a given pattern P and a 
string S, Naive-Match tests whether or not P occurs in S by looking in a nondeterministic way for 
two strings L and R such that S is the concatenation of L, P, and R in this order. 

The reader may verify that the partial deduction strategy does not derive a DFA when starting from 
the program Naive-Match. Indeed, if we specialize Naive-Match w.r.t. the goal naive -match ([a, a, b], S) 
by applying the partial deduction strategy using the unfolding selection function DetU and the gen- 
eralization function Variant, then we derive the following program Naive -Matchpd which does not 
correspond to a DFA and it is nondeterministic: 



Pro 


gram Naive -Matchpd 


(specialized by partial deduction, nondeterministic) 


4. 


naive -match p( i{S) <— 


newl(X,R,S), new2(L,X) 


5. 


newl{[ ],Y,Y) «- 




6. 


newl([A\X],Y,[A\Z}) 


<- newl(X,Y,Z) 


7. 


new2([ ], [a, a, b]) <— 




8. 


new2([A\X],[A\Z]) <- 


new2{X, Z) 



Indeed, this Naive -Matchpd program looks in a nondeterministic way for two strings L and R such 
that S is the concatenation of L, [a, a, b], and R. If the pattern [a, a, b] is not found within the string 
S at a given position, then the search for [a, a, b] is restarted after a shift of one character to the right 
of that position. 

From the program Naive-Match we can derive a specialized program which is much more efficient 
than Naive-Matchpd by applying conjunctive partial deduction, instead of partial deduction. Conjunc- 
tive partial deduction, viewed as a sequence of applications of transformation rules, enhances partial 
deduction because: (i) one may introduce a definition clause whose body is a conjunction of atoms, 
instead of one atom only (see rule PD1), and (ii) one may fold a clause w.r.t. a conjunction of atoms 
in its body, instead of one atom only (see rule PD4). By applying conjunctive partial deduction one 
may avoid intermediate data structures, such as the list X constructed by using clause 1 of program 
Naive-Match. Indeed, by using the ECCE system for conjunctive partial deduction |2l], from the 
Naive-Match program we derive the following specialized program: 

Program Naive -Matched (specialized by conjunctive partial deduction, nondeterministic) 

9. naive-match cp d([X,Y, Z\S]) <— new 1 ( X, Y, Z, S) 

10. newl(a, a, b, S) <— 

11. newl(X, Y, Z, [C\S]) <- newl(Y, Z, C, S) 

This Naive -Match C pd program searches for the pattern [a, a, b] in the input string by looking at the 
first three elements of that string. If they are a, a, and b, in this order, then the search succeeds, 
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otherwise the search for the pattern continues in the tail of the string. Although this Naive -Matched 
program is much more efficient than the initial Naive-Match program, it does not correspond to a 
DFA because, when searching for the pattern [a,a,b], it looks at a prefix of length 3 of the input 
string, instead of one symbol only. 

The failure of partial deduction and conjunctive partial deduction to derive a DFA when starting 
from the Naive-Match program, is due to some limitations which can be overcome by using the 
enhanced transformation rules we will present in the next section. By applying these enhanced rules 
we can define a new predicate by introducing several clauses whose bodies are non-atomic goals, while 
by applying the rules for partial deduction or conjunctive partial deduction, a new predicate can be 
defined by introducing one clause only. By folding using definition clauses of the enhanced form, we 
can derive specialized programs where nondeterminism is reduced and intermediate data structures 
are avoided. Among our enhanced rules we also have the so called case split rule which, given a 
clause, produces two mutually exclusive instances of that clause by introducing negated equations. 
The application of this rule allows subsequent folding steps which reduce nondeterminism. 

By applying the enhanced transformation rules according to the Determinization Strategy we will 
present in Section|HJ one can automatically specialize the nondeterministic program Naive-Match w.r.t. 
the goal naive -match ([a, a, b], 5) thereby deriving the following deterministic program (this derivation 
is not presented here and it is similar to the one presented in Section [7.1(1 : 

Program Naive-Match s (specialized by Determinization, deterministic) 



12. 


naive -match S {S) <— newl(S) 


13. 


newl([a\S]) <- 


- new2(S) 


14. 


newl{[C\S]) * 


— Cj^a, newl(S) 


15. 


new2([a\S]) <- 


- new3(S) 


16. 


new2([C\S]) * 


— C^a, newl(S) 


17. 


new3([b\S]) *- 


- newA(S) 


18. 


new3([a 5]) <- 


- new3(S) 


19. 


new3([C\S}) * 


- C^b,C^a,newl{S) 


20. 


newA(S) <— 





The program Naive-Match s corresponds in a straightforward way to a DFA. Moreover, since the clauses 
of Naive-Match s are pairwise mutually exclusive, the disequations in their bodies can be dropped in 
favor of cuts (or equivalently, if-then-else constructs) as follows: 



Program Naive-Match cu t 


(specialized, with cuts) 


21. naive -match S {S) <— newl(S) 




22. newl([a\S]) <- 


-!, new2(S) 




23. newl([C\S}) <- 


- newl(S) 




24. new2([a\S]) ±- 


-!, new3(S) 




25. new2([C\S]) <- 


- new 1(5) 




26. new3([b\S}) «- 


!, new4(S) 




27. new3([a\S}) «- 


-!, new3(S) 




28. new3([C\S}) <- 


- new 1(5) 




29. newA(S) <- 







Computer experiments confirm that the final Naive-Match cu t program is indeed more efficient than 
the Naive-Match, Naive-Match^, and Naive -Match cp d programs. In Section [7| we will present more 
experimental results which demonstrate that the specialized programs derived by our technique are 
more efficient than those derived by partial deduction or conjunctive partial deduction. 
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4 Transformation Rules for Logic Programs with Equations and Dis- 
equations between Terms 



In this section we present the program transformation rules which we use for program specialization. 
These rules extend the unfold /fold rules considered in [141 14U1 146j to logic programs with atoms which 
denote equations and disequations between terms. The transformation rules we present in this section 
enhance in several respects the rules PD1-PD4 for partial deduction which we have considered in 
Section |3 In particular, we consider a definition introduction rule (see Rule ^) which allows the 
introduction of new predicates defined by several clauses whose bodies are non-atomic goals, while by 
rule PD1 a new predicate can be defined by introducing one clause whose body is an atomic goal. 
We also consider a folding rule (see Rule by which we can fold several clauses at a time, while 
by rule PD4 we can fold one clause only. In addition, we consider the subsumption rule and the 
following transformation rules for introducing and eliminating equations and disequations: (i) head 
generalization, (ii) case split, (hi) equation elimination, and (iv) disequation replacement. Our rules 
preserve the least Herbrand model as indicated in Theorem El below. 

4.1 Transformation Rules 

Similarly to Section |3J the process of program transformation is viewed as a transformation sequence 
constructed by applying some transformation rules. However, as already mentioned, in this section we 
consider an enhanced set of transformation rules. A transformation sequence Pq, . . . , P n is constructed 
from a given initial program Pq by applications of the transformation rules ECU given below, as follows. 
For k = 0, ... ,n — 1, program P k +\ is derived from program P k by: (i) selecting a (possibly empty) 
subset 71 of clauses of Pk, (ii) deriving a set 72 of clauses by applying a transformation rule to 71, and 
(hi) replacing 71 by 72 in P k . 

Notice that rules 13 and El are in fact equal to rules PD2 and PD3, respectively. However, we rewrite 
them below for the reader's convenience. 

Rule 1 (Definition Introduction) We introduce m (> 1) new clauses, called definition clauses, of 
the form: 



where: (i) newp is a non-basic predicate symbol not occurring in Pq, . . . ,P k , (ii) the variables X\, . . . , X^ 
are all distinct and for all i G {1, . . . , h} there exists j G {1, . . . , to} such that Xi occurs in the goal 
Bodyj, (hi) for all j G {1, . . . ,m}, every non-basic predicate occurring in Bodyj also occurs in Pq, and 
(iv) for all j G {1, . . . , m}, there exists at least one non-basic atom in Bodyj. 
Program Pk+i is the program P k U {D\, . . . , D m }. 

As in Section |3J we denote by Defs k the set of definition clauses introduced by the definition 
introduction rule during the construction of the transformation sequence Pq, . . . , Pk- In particular, we 
have that Defs = 0. 

Rule 2 (Definition Elimination) Let p be a predicate symbol. By definition elimination w.r.t.p 
we derive the program Pk+i = {C G P k \ p depends on C}. 

Rule 3 (Unfolding) Let C be a renamed apart clause of Pk of the form: H <— G\, A, G2, where A 
is a non-basic atom. Let C\, . . . , C m , with m > 0, be the clauses of Pk such that, for i = 1, . . . , m, A 
is unifiable with the head of Cj via the mgu By unfolding C w.r.t. A, for i = 1, . . . ,m, we derive 
the clause A : (H <- G 1 ,bd{C i ),G 2 )^ i . 
Program P k+1 is the program (P k - {C}) U {D 1 , ... , D m }. 

Notice that an application of the unfolding rule to clause C amounts to the deletion of C iff m = 0. 
Sometimes in the literature this particular instance of the unfolding rule is treated as an extra rule. 
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Rule 4 (Folding) Let 

d. H <- G 1 ,Body 1 $,G 2 

C m .H^ G 1 ,Body m ti,G 2 
be renamed clauses of P k , for a suitable substitution i?, and let 

{D 1 . newp(Xx,... , X h ) <- Body x 
D m . newp(X 1 , X h ) <- 5o(iy m 

be all clauses in Defs k which have newp as head predicate. Suppose that for % = 1, . . . , m, the following 
condition holds: for every variable X occurring in the goal Bodyi and not in {X\, . . . , Xh}, we have 
that: (i) X'Q is a variable which does not occur in (H, G\, G 2 ), and (ii) Xfi does not occur in Vi?, for 
any variable Y occurring in Bodyi and different from X. By folding C\, . . . , C m using D\ } . . . , D m we 
derive the single clause E: H <— G\, newp{X\, . . . , Xh)^, G 2 . 
Program P k+1 is the program (P k - {C\, . . . , C m }) U {E}. 

For instance, the clauses C\: p(X) *— q(t(X),Y),r(Y) and C 2 - p(X) <— s(X),r(Y) can be folded 
(by considering the substitution i? = {U/X,V/Y}) using the two definition clauses D\\ a(U,V) <— 
q(t(U),V) andD 2 : a(U,V) <- s(U), and we replace C x and C 2 by the clause E: p(X) <- a(X,Y),r(Y). 

Rule 5 (Subsumption) (i) Given a substitution we say that a clause H <^ G\ subsumes a clause 
(ff<-Gi,G 2 )i?. 

Program -Pfc+i is derived from program P k by deleting a clause which is subsumed by another clause 
in P k . 

Rule 6 (Head Generalization) Let C be a clause of the form: H{X/t} <— 5ody in P^, where 
{X/t} is a substitution such that X occurs in H and X does not occur in C. By /lead generalization, 
we derive the clause GenC: H <— X=t, Body. 
Program P k +i is the program (P k — {C}) U {GenC}. 

Rule is a particular case of the rule of generalization + equality introduction considered, for 
instance, in |38j . 

Rule 7 (Case Split) Let C be a clause in P k of the form: H <— Body. By case sp/ii of C w.r.t. the 
binding X/i where X does not occur in t, we derive the following two clauses: 

C x . (H <- Body){X/t} 
C 2 . H *- X^t,Body. 

Program Pfc+i is the program {P k — {C}) U {C\, C 2 }. 

In this RuleEl we do not assume that X occurs in C. However, in the Determinization Strategy of 
SectionH3 we will always apply the case split rule to a clause C : H <— Body w.r.t. a binding X/t where 
X occurs in H. This use of the case split rule will be sufficient to derive mutually exclusive clauses. 
Indeed, according to our operational semantics, if G i — > p fe+1 G\ using clause C\ and X occurs in H, 
then no G 2 exists such that G i — >p k+1 G 2 using clause C 2 . The same holds by interchanging C% and 
C 2 . We will return to this property in Definitions |H1 (Semideterminism) and ^] (Mutual Exclusion) 
below. 

Rule 8 (Equation Elimination) Let C\ be a clause in P k of the form: 
C\. H <— G\, t x =t 2 , G 2 

If t\ and t 2 are unifiable via the most general unifier then by equation elimination we derive the 
following clause: 

C 2 . (H<-Gi,G 2 )# 
Program P k +i is the program (P k — {C x }) U {C 2 }. 

If t\ and t 2 are not unifiable then by equation elimination we derive program Pfc+i which is P k — {Ci}. 
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Rule 9 (Disequation Replacement) Let C be a clause in program P^. Program Pk+i is derived 
from Pfc by either removing C or replacing C as we now indicate: 

01 if C is of the form: H <— G\,t\^t%,G% and t\ and £2 are not unifiable, then C is replaced by 
H < — G\ , G2 

EJ2 if C is of the form: H <— Gi, f(t\, ■ ■ ■ ,t m ) ^ f(u±, . . . ,u m ), G2, then C is replaced by the 
following m (> 0) clauses: H <— G\, t% ^U\, G2, H <— G\, t m ^u m , G2 

|Hj3 if C is of the form: H <— G\, X ^X, G2, then C is removed from P^ 

034 if C is of the form: H <- G u t^X, G 2 , then C is replaced by H *- Gh.,X^t,G 2 

El 5 if C is of the form: .PT <— G±,X ^tx,G 2 ,X ^t2,Gs and there exists a substitution p which is 
a bijective mapping from the set of the local variables of X ^ t\ in C onto the set of the local 
variables of X^t2 in C such that t\p = t2, then C is replaced by <— Gi, X^ti, G2, G3. 

In particular, by Rule 05, if a disequation has occurs twice in the body of a clause, then we can 
remove the rightmost occurrence. 

4.2 Correctness of the Transformation Rules w.r.t. the Declarative Semantics 

In this section we show that, under suitable hypotheses, our transformation rules preserve the declar- 
ative semantics presented in Section 12.21 In that sense we also say that our transformation rules 
are correct) w.r.t. the given declarative semantics. The following correctness theorem extends simi- 
lar results holding for logic programs j!41 1401 146j to the case of logic programs with equations and 
disequations. 

Theorem 5 (Correctness of the Rules w.r.t. the Declarative Semantics) Let Po,--- ,P n be 
a transformation sequence constructed by using the transformation rules an d let p be a non- 
basic predicate in P n . Let us assume that: 

1. if the folding rule is applied for the derivation of a clause C in program Pk+i from clauses 
C\ , . . . , C m in program P}, using clauses D\ , . . . , D m in Defs^ , with < k < n, 

then for every i € {1, . . . , m} there exists j G {1, . . . , n— 1} such that Di occurs in Pj and Pj+i 
is derived from Pj by unfolding Di. 

2. during the transformation sequence Po,...,P n the definition elimination rule either is never 
applied or it is applied w.r.t. predicate p once only, in the last step, that is, when deriving P n 
from P n _i. 

Then, for every ground atom A with predicate p, we have that M(Pq U Defs n ) \= A iff M{P n ) \= A. 

Proof: It is a simple extension of a similar result presented in ^1] for the case where we use the 
unfolding, folding, and generalization + equality introduction rules. The proof technique used in Jl] 
can be adapted to prove also the correctness of our extended set of rules. □ 

In Example ^ of Section |31 we have shown that the unfolding rule may not preserve the operational 
semantics. The following examples show that also other transformation rules may not preserve the 
operational semantics. 

Example 2 Let us consider the following program Pi : 

1. p(X)<-q(X), X + a 

2. q(X)<- 

3. q(X)^X = b 

By rule El we may delete clause 3 which is subsumed by clause 2 and we derive a new program P2 ■ 
Now, we have that p(X) succeeds in Pi, while it does not succeed in P2. 
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Example 3 Let us consider the following program P3 : 

1. p(X)<- 

By the case split rule we may replace clause 1 by the two clauses: 

2. p(a)«- 

3. p(X)^X + a 

and we derive a new program P4. The goal p(X), X = b succeeds in P3, while it does not succeed in 
Pa- 

Example 4 Let us consider the following program P5 : 

1. p^X^a, X = b 

By rule |S] we may replace clause 1 by: 

2. p^b^a 

and we derive a new program Pg. The goal p does not succeed in P5, while it succeeds in Pq. 

Finally, let us consider the following two operations on the body of a clause: (i) removal of a duplicate 
atom, and (ii) reordering of atoms. The following examples show that these two operations, which 
preserve the declarative semantics, may not preserve the operational semantics. Notice, however, that 
the removal of a duplicate atom and the reordering of atoms cannot be accomplished by the trans- 
formation rules listed in Section |3J except for special case considered at Point 05 of the disequation 
replacement rule. 

Example 5 Let us consider the program P7: 

1. p^q(X,Y), q(X,Y), X + Y 

2. q(X,b)<- 

3. q(a,Y)<- 

and the program P^ obtained from P7 by replacing clause 1 by the following clause: 

4. p^q(X,Y), X^Y 

The goal p succeeds in P7, while it does not succeed in P$. Indeed, (i) for program P7 we have that: 
p 1 — >p 7 q(X,Y), q(X,Y), X y^Y 1 — >p T q(X,b),X^b 1 — >p 7 a^b 1 — >p 7 true, and (ii) for program 
Ps we have that: either p 1 — >p g X^b or p 1 — >p g a^Y. In Case (ii), since X and Y are unifiable 
with b and a, respectively, we have that p 1 — >p s true does not hold. 

Example 6 Let us consider the program P9: 

1. p<-q{X), r(X) 

2. q(a)<- 

3. r(X)^X^b 

and the program P10 obtained from Pg by replacing clause 1 by the following clause: 

4. p^r(X), q(X) 

The goal p succeeds in Pg, while it does not succeed in Piq. 

In the next section we will introduce a class of programs and a class of goals for which our transfor- 
mation rules preserve both the declarative semantics and the operational semantics. In order to do 
so, we associate a mode with every predicate. A mode of a predicate specifies the input arguments of 
that predicate, and we assume that whenever the predicate is called, its input arguments are bound to 
ground terms. We will see that, if some suitable conditions are satisfied, compliance to modes guar- 
antees the preservation of the operational semantics. This fact is illustrated by the above Examples [21 
and El and indeed, in each of them, if we restrict ourselves to calls of the predicate p with ground 
arguments, then the initial program and the derived program have the same operational semantics. 

Notice, however, that the incorrectness of the transformation of Example 0] does not depend on 
the modes. Thus, in order to ensure correctness w.r.t. the operational semantics we have to rule out 
clauses such as clause 1 of program P5. Indeed, as we will see in the next section, the clauses we will 
consider satisfy the following condition: each variable which occurs in a disequation either occurs in 
an input argument of the head predicate or it is a local variable of the disequation. 
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5 Program Transformations based on Modes 



Modes provide information about the directionality of predicates, by specifying whether an argument 
should be used as input or output (see, for instance, [321 HH1)- Mode information is very useful for 
specifying and verifying logic programs [21 E3 an d it is used in existing compilers, such as Ciao and 
Mercury, to generate very efficient code [191 145j . Mode information has also been used in the context 
of program transformation to provide sufficient conditions which ensure that reorderings of atoms in 
the body of a clause preserve program termination [3]. 

In this paper we use mode information for: (i) specifying classes of programs and goals w.r.t. which 
the transformation rules we have presented in Section 14,11 preserve the operational semantics (see 
Section f2.3j) . and (ii) designing our strategy for specializing programs and reducing nondeterminism. 

5.1 Modes 

A mode for a non-basic predicate p of arity h (> 0) is an expression of the form p(mi,... , m^), 
where for i = l,...,h, mi is either + (denoting any ground term) or ? (denoting any term). In 
particular, if h = 0, then p has a unique mode which is p itself. Given an atom p(ti, . . . , i/J and a 
mode p(mi , . . . , rrih) , 

(1) for i = 1, . . . , h, the term t{ is said to be an input argument of p iff rrtj is +, and 

(2) a variable of p(t±, . . . ,th) with an occurrence in an input argument of p, is said to be an input 
variable of p(ti, . . . , i/,). 

A mode for a program P is a set of modes for non-basic predicates containing exactly one mode 
for every distinct, non-basic predicate p occurring in P. 

Notice that a mode for a program P may or may not contain modes for non-basic predicates which 
do not occur in P. Thus, if M is a mode for a program P\ and, by applying a transformation rule, 
from P\ we derive a new program Pi where all occurrences of a predicate have been eliminated, then 
M is a mode also for P<i- The following rules may eliminate occurrences of predicates: definition 
elimination, unfolding, folding, subsumption, disequation replacement (case 05). Clearly, if from P\ 
we derive Pi by applying the definition introduction rule, then in order to obtain a mode for P2 we 
should add to M a mode for the newly introduced predicate (unless it is already in M). 

Example 7 Given the program P: 

p(0,l) <- 
p(0,Y)^q(Y) 

the set Mi = ?), <?(?)} is a mode for P. M2 = {p(+, ?),<?(+), r(+)} is a different mode for P. 

Definition 2 Let M be a mode for a program P and p a non-basic predicate. We say that an atom 
p(ti, . . . , th) satisfies the mode M iff (1) a mode for p belongs to M and (2) for i = 1, . . . , h, if the 
argument t$ is an input argument of p according to M, then £j is a ground term. In particular, when 
h = 0, we have that p satisfies M iff p € M. 

The program P satisfies the mode M iff for each non-basic atom ^4o which satisfies M, and for each 
non-basic atom A and goal G such that Aq i — >p (A, G), we have that A satisfies M. 

With reference to Example [7| above, program P satisfies mode M\, but it does not satisfy mode M%. 

In general, the property that a program satisfies a mode is undecidable. Two approaches are 
usually followed for verifying this property: (i) the first one uses abstract interpretation methods (see, 
for instance, [13132]) which always terminate, but may return a don't know answer, and (ii) the second 
one checks suitable syntactic properties of the program at hand, such as well-modedness [2], which 
imply that the mode is satisfied. 

Our technique is independent of any specific method used for verifying that a program satisfies 
a mode. However, as the reader may verify, all programs presented in the examples of Sectional are 
well-moded and, thus, they satisfy the given modes. 
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5.2 Correctness of the Transformation Rules w.r.t. the Operational Semantics 

Now we introduce a class of programs, called safe programs, and we prove that if the transformation 
rules are applied to a safe program and suitable restrictions hold, then the given program and the 
derived program are equivalent w.r.t. the operational semantics. 

Definition 3 (Safe Programs) Let M be a mode for a program P. We say that a clause C in P 
is safe w.r.t. M iff for each disequation t x ^ ti in the body of C, we have that: for each variable X 
occurring in t x 7^ ti either X is an input variable of hd{C) or X is a local variable of t x ^ ti in C . 
Program P is safe w.r.t. M iff all its clauses are safe w.r.t. M. 

For instance, let us consider the mode M = {p(+), q(?)}. Clause p(X) <— X^f{Y) is safe w.r.t. M 
and clause p(X) <— X^ f(Y), q(Y) is not safe w.r.t. M because Y occurs both in f(Y) and in q(Y). 

When mentioning the safety property w.r.t. a given mode M, we feel free to omit the reference to 
M, if it is irrelevant or understood from the context. 

In order to get our desired correctness result (see Theorem El below), we need to restrict the use of 
our transformation rules as indicated in Definitions 14171 below. In particular, these restrictions ensure 
that, by applying the transformation rules, program safety and mode satisfaction are preserved (see 
Propositions El and 0] in Appendix A). 

Definition 4 (Safe Unfolding) Let P^ be a program and M be a mode for P/.. Let us consider an 
application of the unfolding rule (see Rule |3] in Section 14. lj) whereby from the following clause of P^ : 

H <— G\, A, G 2 
we derive the clauses: 

' D x . (H ^G 1 ,bd(C 1 ),G 2 )#i 

< ■ ■ ■ 

_ D m . (H*-G 1} bd(C m ),G 2 )^ m 

where Ci, . . . , C m are the clauses in P^ such that, for i £ {1, ... , m}, A is unifiable with the head of 
Ci via the mgu 

We say that this application of the unfolding rule is safe w.r.t. mode M iff for all i = 1, . . . , m, for all 
disequations d in bd(Ci), and for all variables X occurring in di?,, we have that either X is an input 
variable of H$i or X is a local variable of d in Cj. 

To see that unrestricted applications of the unfolding rule may not preserve safety, let us consider 
the following program: 

1. p <-<?(*), r(X) 

2. g(l)<- 

3. r(X)^X^0 

and the mode M = {p, #(?), for it. By unfolding clause 1 w.r.t. the atom r(X) we derive the 

clause: 

4. p^q(X), X^O 

This clause is not safe w.r.t. M because X does not occur in its head. 

Definition 5 (Safe Folding) Let us consider a program P^ and a mode M for P^. Let us also 
consider an application of the folding rule (see Rule |1] in Section 14. 1|) whereby from the following 
clauses in P&: 

d.H*- G 1 ,(A 1 ,K 1 )^,G 2 

C m . H <— Gi,(A m ,K m )'d,G2 
and the following definition clauses in Defs^- 

D 1 . newp(Xi,. . .,X h ) <- A X ,K X 

< ■ ■ ■ 

D m . newp(X x , . . . , X h ) <- A m , K m 
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we derive the new clause: 

H<-G 1 ,newp(X 1 ,...,X h )0,G 2 
We say that this application of the folding rule is safe w.r.t. mode M iff the following Property E 
holds: 

(Property E) Each input variable of newp(X\, . . . , Xh)^ is also an input variable of at least one of 
the non-basic atoms occurring in (H, G±, A±i9, . . . , A m #). 

Unrestricted applications of the folding rule may not preserve modes. Indeed, let us consider the 
following initial program: 

1. p^q(X) 

2. g(l)<- 

Suppose that first we introduce the definition clause: 

3. new(X) «- q(X) 

and then we apply the clause split rule, thereby deriving: 

4. neiu(O) «- g(0) 

5. new(X) «- X^O, q(X) 

The program made out of clauses 1, 2, 4, and 5 satisfies the mode M = {p, <?(?), new(+)}. By folding 
clause 1 using clause 3 we derive: 

6. p <— new(X) 

This application of the folding rule is not safe and the program we have derived, consisting of clauses 
2, 4, 5, and 6, does not satisfy M. 

Definition 6 (Safe Head Generalization) Let us consider a program and a mode M for P^. 
We say that an application of the head generalization rule (see Rule El in Section l4.1|) to a clause of 
Pfc is safe iff X is not an input variable w.r.t. M. 

The restrictions considered in Definition El are needed to preserve safety. For instance, the clause 
p(t(X)) ^X^O is safe w.r.t. the mode M = {p{+)}, while p{Y) <- Y = t(X),X^0 is not. 

Definition 7 (Safe Case Split) Let us consider a program and a mode M for P^. Let us consider 
also an application of the case split rule (see Rule in Section f4.1j) whereby from a clause C in P^ of 
the form: H <— Body we derive the following two clauses: 

C x . (H <— Body){X/t] 
C 2 . H <- X^t,Body. 

We say that this application of the case split rule is safe w.r.t. mode M iff X is an input variable of 
H, X does not occur in t, and for all variables Y G vars(t), either Y is an input variable of H orY 
does not occur in C. 

When applying the safe case split rule, X occurs in H and thus, given a goal G, it is not the case 
that for some goals G\ and G 2 , we have both G i — ► Gi using clause C\ and G i — > G*2 using clause 
Ci. In Definition 1121 below, we will formalize this property by saying that the clauses C\ and C 2 are 
mutually exclusive. 

Similarly to the unfolding and head generalization rules, the unrestricted use of the case split 
rule may not preserve safety. For instance, from the clause p(X) <— which is safe w.r.t. the mode 
M = {f?(?)}, we may derive the two clauses p(0) <— and p(X) <— X j^O, and this last clause is not safe 
w.r.t. M. 

We have shown in Section 14.11 (see Example |SJ), that the reordering of atoms in the body of a 
clause may not preserve the operational semantics. Now we prove that a particular reordering of 
atoms, called disequation promotion, which consists in moving to the left the disequations occurring in 
the body of a safe clause, preserves the operational semantics. Disequation promotion (not included, 
for reason of simplicity, among the transformation rules) allows us to rewrite the body of a safe clause 
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so that every disequation occurs to the left of every atom different from a disequation thereby deriving 
the normal form of that clause (see Section |HJ) . The use of normal forms will simplify the proof of 
Theorem H3 below and the presentation of the Determinization Strategy in Section H3 

Proposition 1 (Correctness of Disequation Promotion) Let M be a mode for a program P\. 
Let us assume that Pi is safe w.r.t. M and Pi satisfies M. Let C\\ H <— G\, G2, t\ 7^ t%, G3 
be a clause in Pi. Let P2 be the program derived from Pi by replacing clause C\ by clause C2' 
H <— Gi, ti 7^ G2, G3. Then: (i) P2 is safe w.r.t. M, (ii) P2 satisfies M, and (iii) for each 
non-basic atom A which satisfies mode M, A succeeds in Pi iff A succeeds in P2. 

Proof: Point (i) follows from the fact that safety does not depend on the position of the disequation in 
a clause. Moreover, the evaluation of goal G2 in program Pi according our operational semantics, does 
not bind any variable in t\ 7^2; and thus, we get Point (ii). Point (iii) is a consequence of Points (i) 
and (ii) and the fact that the evaluation of t\ 7^ £2 does not bind any variable in the goals G2 and G3 . 
□ 

The above proposition does not hold if we interchange clause C\ and C2. Consider, in fact, the 
following clause which is safe w.r.t. mode M = {p(+), <?(+)}: 
G 3 . p(X)^X^Y, q(Z) 

This clause satisfies M because for all derivations starting from a ground instance p{t) of p{X) the 
atom ty^Y does not succeed. In contrast, if we use the clause C4: p(X) <— q(Z), X y^Y, we have that 
in the derivation starting from p(t), the variable Z is not bound to a ground term and thus, clause C4 
does not satisfy the mode M which has the element (/(+). 

In Theorem below we will show that if we apply our transformation rules and their safe versions 
in a restricted way, then a program P which satisfies a mode M and is safe w.r.t. M, is transformed 
into a new program, say Q, which satisfies M and is safe w.r.t. M. Moreover, the programs P and Q 
have the same operational semantics. 

Theorem 6 (Correctness of the Rules w.r.t. the Operational Semantics) Let Po,...,P n be 
a transformation sequence constructed by using the transformation rules an d let p be a non-basic 
predicate in P n . Let M be a mode for PoL)Defs n such that: (i) PoUPe/s n is safe w.r.t. M, (ii) PoL)Defs n 
satisfies M, and (iii) the applications of the unfolding, folding, head generalization, and case split rules 
during the construction of Po, . . . ,P n are all safe w.r.t. M. Suppose also that Conditions ^ and [2] of 
Theorem hold. Then: (i) P n is safe w.r.t. M, (ii) P n satisfies M, and (iii) for each atom A which 
has predicate p and satisfies mode M, A succeeds in Po U Defs n iff A succeeds in P n . 

Proof: See Appendix A. □ 
5.3 Semideterministic Programs 

In this section we introduce the concept of semideterminism which characterizes the class of programs 
which can be obtained by using the Determinization Strategy of Section EJ (The reader should not 
confuse the notion of semideterminism presented here with the one considered in |18|.) 

We have already noticed that if a program P is deterministic for an atom A according to Defini- 
tion^ then there is at most one successful derivation starting from A, and A succeeds in P with at 
most one answer substitution. Thus, if an atom succeeds in a program with more than one answer 
substitution, and none of these substitutions is more general than another, then there is no chance to 
transform that program into a new program which is deterministic for that atom. 

For instance, let us consider the following generalization of the problem of Sections 13.21 and 
Given a pattern P and a string S we want to compute the position, say N, of an occurrence of P in 
S, that is, we want to find two strings L and R such that: (i) S is the concatenation of L, P, and R, 
and (ii) the length of L is N. The following program Match_Pos computes N for any given P and S: 
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The Match-Pos program is nondeterniinistic for atoms of the form match_pos(P, S, N) where P and 
S are ground lists, and it computes one answer substitution for each occurrence of P in S. 

Suppose that we want to specialize Match-Pos w.r.t. the atom match-pos([a,a,b], S, N). Thus, 
we want to derive a new, specialized program Match_Pos s and a new binary predicate match-pos s . 
This new program should be able to compute multiple answer substitutions for a goal. For instance, 
for the atom match_pos s ([a,a,b,a,a,b], N) the program Match-Pos s should compute the two sub- 
stitutions {N/0} and {N/ s(s(s(0)))} and, thus, Match_Pos s cannot be deterministic for the atom 
match_pos s ([a, o, b, a, a, b],N). 

Now, in order to deal with programs which may return multiple answer substitutions, we introduce 
the notion of semideterminism, which is weaker than that of determinism. Informally, we may say 
that a semideterministic program has the minimum amount of nondeterminism which is needed to 
compute multiple answer substitutions. In Section H3 we will prove that the Determinization Strategy, 
if it terminates, derives a semideterministic program. 

Definition 8 (Semideterminism) A program P is semideterministic for a non-basic atom A iff for 
each goal G such that A =^>* p G, there exists at most one clause C such that G =^c G' for some goal 
G' different from true. 

Given a mode M for a program P, we say that P is semideterministic w.r.t. M iff P is semideterministic 
for each non-basic atom which satisfies M. 

We will show in Section [7.1 1 that by applying the Determinization Strategy, from Match_Pos s we 
derive the following specialized program Match-Pos s which is semideterministic for atoms of the form 
match_pos s (S, N), where S is a ground list. 

Program Match_Pos s (specialized, semideterministic) 

9. match-pos s (S, N) <— newl(S, N) 

20. newl([a\S],M) <- new2(S,M) 

21. newl([C\S], s(N)) <- C^a, newl{S,N) 

32. new2([a\S], M) <- new3(S,M) 

33. new2([C\S],s(s(N))) <- C^a, newl(S,N) 

46. new3([a\S], s(M)) <- new3(R, S) 

47. new3([b\S], M) «- new4(R, S) 

48. new3([C\S],s(s(s(N)))) «- C^a, newl(£,iV) 

49. neu;4(5, 0) <- 

55. new4([a|S],s(s(s(M)))) <- new2(S,M) 

56. new4([C|5],s(s(s(s(JV))))) <- C^a, newl(5,iV) 

Now we give a simple sufficient condition which ensures semideterminism. It is based on the concept 
of mutually exclusive clauses which we introduce below. We need some preliminary definitions. 

Definition 9 (Satisfiability of Disequations w.r.t. a Set of Variables) Given a set V of vari- 
ables, we say that a conjunction D of disequations, is satisfiable w.r.t. V iff there exists a ground 
substitution a with domain V, such that every ground instance of Da holds (see Section |2.2|) , In 
particular, D is satisfiable w.r.t. iff every ground instance of D holds. 

The satisfiability of a conjunction D of disequations w.r.t. a given set V of variables, can be checked 
by using the following algorithm defined by structural induction: 
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(1) true, i.e., the empty conjunction of disequations, is satisfiable w.r.t. V, 

(2) (D\, D 2 ) is satisfiable w.r.t. V iff both D\ and D 2 are satisfiable w.r.t. V, 

(3) X/t is satisfiable w.r.t. V iff X occurs in V and t is either a non-variable term or a variable 
occurring in V distinct from X, 

{A) t ^ X is satisfiable w.r.t. V iff X^t is satisfiable w.r.t. V, 

(5) /(. . .) 7^ g(. . .), where / and g are distinct function symbols, is satisfiable w.r.t. V, and 

(6) f(t\, . . . , t m ) ^ • • • , M m ) is satisfiable w.r.t. V iff at least one disequation among t\^u\, . . . , 
tm^Um is satisfiable w.r.t. V. 

The correctness of this algorithm relies on the fact that the set of function symbols is infinite (see 
Section EOT). 

Definition 10 (Linearity) A program P is said to be linear iff every clause of P has at most one 
non-basic atom in its body. 

Definition 11 (Guard of a Clause) The guard of a clause C, denoted grd(C), is bd(C) if all atoms 
in bd(C) are disequations, otherwise grd(C) is the (possibly empty) conjunction of the disequations 
occurring in bd(C) to the left of the leftmost atom which is not a disequation. 

Definition 12 (Mutually Exclusive Clauses) Let us consider a mode M for the following two, 
renamed apart clauses: 

C\. p(£i,ui) <- G\ 
C 2 . p{t 2 ,u 2 ) <- G 2 

where: (i) p is a predicate of arity k (>0) whose first h arguments, with 0<h<k, are input arguments 
according to M, (ii) t\ and £2 are /i-tuples of terms denoting the input arguments of p, and (iii) u\ 
and u 2 are (k — /i)-tuples of terms. 

We say that Ci and C 2 are mutually exclusive w.r.t. mode M iff either (i) t± is not unifiable with t 2 or 
(ii) ii and t 2 are unifiable via an mgu i? and (grd(Cx), grd{C 2 ))'& is not satisfiable w.r.t. vars(ti,t 2 ). 
If /i = we stipulate that the empty tuples ii and £2 are unifiable via an mgu which is the identity 
substitution. 

The following proposition is useful for proving that a program is semideterministic. 

Proposition 2 (Sufficient Condition for Semideterminism) If (i) P is a linear program, (ii) P 
is safe w.r.t. a given mode M, (iii) P satisfies M, and (iv) the non-unit clauses of P are pairwise 
mutually exclusive w.r.t. M, then P is semideterministic w.r.t. M. 

Proof: See Appendix B. □ 

In Section we will present a strategy for deriving specialized programs which satisfies the hy- 
potheses (i)-(iv) of the above Proposition |2 and thus, these derived programs are semideterministic. 
The following examples show that in Proposition |2] no hypothesis on program P can be discarded. 

Example 8 Consider the following program P and the mode M = {p, q} for P: 

1. p<- q, q 

2. q^ 

3. q «- q 

P is not linear, but P is safe w.r.t. M and P satisfies M. The non-unit clauses of P which are 
the clauses 1 and 3, are pairwise mutually exclusive. However, P is not semideterministic w.r.t. M, 
because p 1 — >* P (q,q), and there exist two non-basic goals, namely q and (q,q), such that (q,q) =^p q 
and (q,q) ^ P (q,q). 
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Example 9 Consider the following program Q and the mode M = {f>(?), qx, 92} for Q: 

1. p(X)^X^0, ft 

2. p(l)<-g2 

Q is linear and it satisfies M, but Q is not safe w.r.t. M because X is not an input variable of p. Clauses 
1 and 2 are mutually exclusive w.r.t. M, because the set of input variables in p(X) is empty and X ^0 
is not satisfiable w.r.t. 0. However, Q is not semideterministic w.r.t. M, because p(l) 1 — >q p(1)> and 
there exist two non-basic goals, namely q\ and g 2 , such that p(l) =^q q± and p(l) =4>q c/ 2 . 

Example 10 Consider the following program R and the mode M = {p, ri, for it!: 

1. p^r(X) 

2. r(l)<-ri 

3. r(2) <- r 2 

i? is linear and safe w.r.t. M, but R does not satisfy M, because p 1 — >r r(X) and X is not a ground 
term. Clauses 1, 2, and 3 are pairwise mutually exclusive. However, R is not semideterministic w.r.t. 
M, because p > — >* R r(X) and there exist two non-basic goals, namely r\ and r 2 , such that r(X) =>r r\ 
and r(X) =4>r r%. 

Example 11 Consider the following program S and the mode M = {p, ri,r2} for S: 

1. p <— n 

2. p <— r 2 

iS is linear and safe w.r.t. M, and S 1 satisfies M. Clauses 1 and 2 are not pairwise mutually exclusive. 

5 is not semideterministic w.r.t. M, because p 1 — p, and there exist two non-basic goals, namely r± 
and r 2 , such that p =^5 r\ and p =^5 r 2 . 

We conclude this section by observing that when a program consists of mutually exclusive clauses 
and, thus, it is semideterministic, it may be executed very efficiently on standard Prolog systems by 
inserting cuts in a suitable way. We will return to this point in Section |H] when we discuss the speedups 
obtained by our specialization technique. 

6 A Transformation Strategy for Specializing Programs and Reduc- 
ing Nondeterminism 

In this section we present a strategy, called Determinization, for guiding the application of the trans- 
formation rules presented in Section 14.11 Our strategy pursues the following objectives. (1) The 
specialization of a program w.r.t. a particular goal. This is similar to what partial deduction does. 
(2) The elimination of multiple or intermediate data structures. This is similar to what the strategies 
for eliminating unnecessary variables |38| and conjunctive partial deduction do. (3) The reduction 
of nondeterminism. This is accomplished by deriving programs whose non-unit clauses are mutually 
exclusive w.r.t. a given mode, that is, by Proposition |5J semideterministic programs. 

The Determinization Strategy is based upon three subsidiary strategies: (i) the Unfold- Simplify 
subsidiary strategy, which uses the safe unfolding, equation elimination, disequation replacement, 
and subsumption rules, (ii) the Partition subsidiary strategy, which uses the safe case split, equation 
elimination, disequation replacement, subsumption, and safe head generalization rules, and (iii) the 
Define-Fold subsidiary strategy which uses the definition introduction and safe folding rules. For rea- 
sons of clarity, during the presentation of the Determinization Strategy we use high-level descriptions 
of the subsidiary strategies. These descriptions are used to establish the correctness of Determinization 
(see Theorem EJ. Full details of the subsidiary strategies will be given in Sections 16.21 16.31 and 16.41 
respectively. 

6.1 The Determinization Strategy 

Given an initial program P, a mode M for P, and an atom p(t±, . . . ,th) w.r.t. which we want to 
specialize P, we introduce by the definition introduction rule, the clause 
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S: p s (X 1 , X r ) <- p(ti, ...,t h ) 
where Xi, . . . , X r are the distinct variables occurring in p[t\, . . . , th). 

We also define a mode p s {m\, . . . , m r ) for the predicate p s by stipulating that, for any j = 1, . . . , r, 
rrij is + iff Xj is an input variable of p(t±, . . . ,th) according to the mode M. We assume that the 
program P is safe w.r.t. M. Thus, also program PUjS 1 } is safe w.r.t. MU{p s (mx, . . . , m r )}. We also 
assume that P satisfies mode M and thus, program P U {S} satisfies mode M U {p a (m\, . . . , m r )}. 

Our Determinization Strategy is presented below as an iterative procedure that, at each iteration, 
manipulates the following three sets of clauses: (1) TransfP, which is the set of clauses from which 
we will construct the specialized program, (2) Defs, which is the set of clauses introduced by the 
definition introduction rule, and (3) Cls, which is the set of clauses to be transformed during the 
current iteration. Initially, Cls consists of the single clause S: p s {X\, . . . , X r ) <— p{t\, . . . , th) which is 
constructed as we have indicated above. 

The Determinization Strategy starts off each iteration by applying the Unfold-Simplify subsidiary 
strategy to the set Cls, thereby deriving a new set of clauses called UnfoldedCls. The Unfold-Simplify 
strategy first unfolds the clauses in Cls, and then it simplifies the derived set of clauses by applying 
the equation elimination, disequation replacement, and subsumption rules. 

Then the set UnfoldedCls is divided into two sets: (i) UnitCls, which is the set of unit clauses, 
and (ii) NonunitCls, which is the set of non-unit clauses. The Determinization Strategy proceeds by 
applying the Partition subsidiary strategy to NonunitCls, thereby deriving a new set of clauses called 
PartitionedCls. The Partition strategy consists of suitable applications of the case split, equation 
elimination, disequation replacement, and head generalization rules such that the set PartitionedCls 
has the following property: it can be partitioned into sets of clauses, called packets, such that two 
clauses taken from different packets are mutually exclusive (w.r.t. a suitable mode). 

The Determinization Strategy continues by applying the Define-Fold subsidiary strategy to the 
clauses in PartitionedCls, thereby deriving a new, semideterministic set of clauses called FoldedCls. 
The Define-Fold subsidiary strategy introduces a (possibly empty) set NewDefs of definition clauses 
such that each packet can be folded into a single clause by using a set of definition clauses in Defs U 
NewDefs. We have that clauses derived by folding different packets are mutually exclusive and, thus, 
UnitCls U FoldedCls is semideterministic. 

At the end of each iteration, UnitCls U FoldedCls is added to TransfP, NewDefs is added to Defs, 
and the value of the set Cls is updated to NewDefs. 

The Determinization Strategy terminates when Cls = 0, that is, no new predicate is introduced 
during the current iteration. 

Determinization Strategy 

Input: A program P, an atom p{t\, ... ,tfr) w.r.t. which we want to specialize P, and a mode M for 
P such that P is safe w.r.t. M and P satisfies M. 

Output: A specialized program P s , and an atomp s (A"i,. . . , X r ), with {X±,. . . , X r } = vars(p{t\,. . . , th)) 
such that: (i) for every ground substitution i? = {X\/u\, . . . ,X r /u r }, M(P) \= p(t\, . . . ,th)$ iff 
M(P S ) \= p s (Xi, . . . , X r )$, and (ii) for every substitution a = {X\/vi, . . . , X r /v r } such that the atom 
p{t\, . . . , th)cr satisfies mode M, we have that: (ii. 1) p(t±, . . . , th)cr succeeds in P iff p s (X\, . . . , X r )a 
succeeds in P s , and (ii.2) P s is semideterministic for p s (X\, . . . ,X r )a. 

Initialize: Let S be the clause p s {X\, . . . , X r ) <— p(t\, . . . , th). 

TransfP := P; Defs := {S}; Cls := {S}; M s :=MU {p s (m\, . . . , m r )}, where for any j = 1, . . . , r, 
rrij = + iff Xj is an input variable of p{t\, . . . , th) according to the mode M; 

while Cls ^ do 

(1) Unfold-Simplify: 

We apply the safe unfolding, equation elimination, disequation replacement, and subsumption 
rules according to the Unfold-Simplify Strategy given in Section 16.21 below, and from Cls we 
derive a new set of clauses UnfoldedCls . 
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(2) Partition: 

Let UnitCls be the unit clauses occurring in UnfoldedCls, and NonunitCls be the set of non-unit 
clauses in UnfoldedCls . 

We apply the safe case split, equation elimination, disequation replacement, and safe head 
generalization rules according to the Partition Strategy given in Section 16.31 below, and from 
NonunitCls we derive a set PartitionedCls of clauses which is the union of disjoint subsets of 
clauses. Each subset is called a packet. The packets of PartitionedCls enjoy the following prop- 
erties: 

(2a) each packet is a set of clauses of the form (modulo renaming of variables): 
H <— Diseqs, G\ 

< 

H <— Diseqs, G m 

where Diseqs is a conjunction of disequations and for k = 1, . . . , m, no disequation occurs in G^, 
and 

(2b) for any two clauses C\ and C2, if the packet of C\ is different from the packet of C%, then 
C\ and C2 are mutually exclusive w.r.t. mode M s . 

(3) Define-Fold: 

We apply the definition introduction and the safe folding rules according to the Define-Fold 
subsidiary strategy given in Section 16.41 below. According to that strategy, we introduce a 
(possibly empty) set NewDefs of new definition clauses and a set M new of modes such that: 
(3a) in M new there exists exactly one mode for each distinct head predicate in NewDefs, and 
(3b) from each packet in PartitionedCls we derive a single clause of the form: 
H <— Diseqs, newp(. . .) 

by an application of the folding rule, which is safe w.r.t. M new , using the clauses in Defs U 
NewDefs. 

Let FoldedCls be the set of clauses derived by folding the packets in PartitionedCls. 

(4) TransfP := TransfP U UnitCls U FoldedCls; Defs := Defs U NewDefs; Cls := NewDefs; 
M s := M s U M new 

end-while 

We derive the specialized program P s by applying the definition elimination rule and keeping only the 
clauses of TransfP on which p s depends. 



The Determinization Strategy may fail to terminate for two reasons: (i) the Unfold-Simplify subsidiary 
strategy may not terminate, because it may perform infinitely many unfolding steps, and (ii) the 
condition Cls 7^ for exiting the while-do loop may always be false, because at each iteration the 
Define-Fold subsidiary strategy may introduce new definition clauses. We will discuss these issues in 
more detail in Section El 

Now we show that, if the Determinization Strategy terminates, then the least Herbrand model and 
the operational semantics are preserved. Moreover, the derived specialized program P s is semideter- 
ministic for p s {X\, . . . , X r )a as indicated by the following theorem. 

Theorem 7 (Correctness of the Determinization Strategy) Let us consider a program P, a 
non-basic atom p{t\, . . . ,th), and a mode M for P such that: (1) P is safe w.r.t. M and (2) P 
satisfies M. If the Determinization Strategy terminates with output program P s and output atom 
p s (Xi, ... , X r ) where {Xi, ... , X r } = vars(p(t\, ... , t h )), then 

(i) for every ground substitution 1? = {X\/ui, . . . , X r /u r }, 

M(P) \= P (t 1} ...,t h )^ iff M{P S ) \= Pa {X u ... t X r )# and 
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(ii) for every substitution a = {Xi/vi, . . . , X r /v r } such that the atom p(ti, . . . , th)cr satisfies mode 
M, 

(11.1) p(t\, . . . , th)o succeeds in P iff p s (X\, . . . , X r )a succeeds in P s , and 

(11.2) P s is semideterministic for p s (Xi, . . . , X r )a. 

Proof: Let Defs and P s be the set of definition clauses and the specialized program obtained at the 
end of the Determinization Strategy. 

(i) Since p s (Xi, . . . , X r ) <— p(t\, . . . ,th) is the only clause for p s in P U Defs and {X\, . . . ,X r } = 
vars(p(ti, . . . ,th)), for every ground substitution $ = {Xi/u\, . . . , X r /u r } we have that M(P) \= 
p(ti, . . . , iff M{P U Defs) \= p s (Xi, . . . ,X r )-d. By the correctness of the transformation rules 
w.r.t. the least Herbrand model (see Theorem EJ, we have that M{P U Defs) \= p s (X±, . . . ,X r )-d iff 
M(P s )\= Ps (X l ,...,X r )^. 

Point (ii.l) follows from Theorem H3 because during the Determinization Strategy, each application of 
the unfolding, folding, head generalization, and case split rule is safe. 

(ii.2) We first observe that, by construction, for every substitution a, the atom p(ti, . . . ,th)cr satisfies 
mode M iff p s (X\, . . . , X r )a satisfies mode M s , where M s is the mode obtained from M at the end of 
the Determinization Strategy. Thus, Point (ii.2) can be shown by proving that P s is semideterministic 
w.r.t. M s . In order to prove this fact, it is enough to prove that TransfP w — P is semideterministic 
w.r.t. M s , where TransfP w is the set of clauses which is the value of the variable TransfP at the end of 
the while-do statement of the Determinization Strategy. Indeed, P s is equal to TransfP w —P because, 
by construction, p s does not depend on any clause of P, and thus, by the final application of the 
definition elimination rule, all clauses of P are removed from TransfP w . 

By Proposition EJ it is enough to prove that: (a) TransfP W —P is linear, (b) TransfP W —P is safe 
w.r.t. M s , (c) TransfP w — P satisfies M s , and (d) the non-unit clauses of TransfP W —P are pairwise 
mutually exclusive w.r.t. M s . 

Property (a) holds because according to the Determinization Strategy, after every application of 
the safe folding rule we get a clause of the form: H <— Diseqs, newp(. . .), where a single non-basic 
atom occurs in the body. All other clauses in TransfP W — P are unit clauses. 

Properties (b) and (c) follow from Theorem recalling that the application of the unfolding, 
folding, head generalization, and case split rules are all safe. 

Property (d) can be proved by showing that, during the execution of the Determinization Strat- 
egy, the following Property (I) holds: all the non-unit clauses of TransfP — P are pairwise mutually 
exclusive w.r.t. M s . Indeed, initially TransfP— P is empty and thus, Property (I) holds. Furthermore, 
Property (I) is an invariant of the while-do loop. Indeed, at the end of each execution of the body of 
the while-do (see Point (4) of the strategy) , the non-unit clauses which are added to the current value 
of TransfP are the elements of the set FoldedCls and those non-unit clauses are derived by applying 
the Partition and Define- Fold subsidiary strategies at Points (3) and (4), respectively. By construc- 
tion, the clauses in FoldedCls are pairwise mutually exclusive w.r.t. M new , and their head predicates 
do not occur in TransfP. Thus, the clauses of TransfP U UnitCls U FoldedCls are pairwise mutually 
exclusive w.r.t. M s UM new . As a consequence, after the two assignments (see Point (4) of the strategy) 
TransfP := TransfP U UnitCls U FoldedCls and M s := M s U M new , we have that Property (I) holds. □ 

Now we describe the three subsidiary strategies for realizing the Unfold-Simplify, Partition, and 
Define-Fold transformations as specified by the Determinization Strategy. We will see these subsidiary 
strategies in action in the examples of Section 

During the application of our subsidiary strategies it will be convenient to rewrite every safe clause 
into its normal form. The normal form iV of a safe clause can be constructed by performing disequation 
replacements and disequation promotions, so that the following Properties N1-N5 hold: 

(Nl) every disequation is of the form: X ^t, with t different from X and unifiable with X, 
(N2) every disequation occurs in bd(N) to the left of every atom different from a disequation, 
(N3) if X occurs in bd(N) and both X and Y are input variables of hd(N), then in hd(N) the 
leftmost occurrence of X is to the left of the leftmost occurrence of Y, 
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(N4) for every disequation of the form X ^ Y where Y is an input variable, we have that also X is an 
input variable, and 

(N5) for any pair of disequations d\ and e?2 in bd{N), it does not exist a substitution p which is a 
bijective mapping from the set of the local variables of d\ in N onto the set of the local variables of 
c?2 in N such that d\p = di- 

We have that: (i) the normal form of a safe clause is unique, modulo renaming of variables and 
disequation promotion, (ii) no two equal disequations occur in the normal form of a safe clause, and 
(hi) given a program P and a mode M for P such that P is safe w.r.t. M and P satisfies M, if we 
rewrite a clause of P into its normal form, then the least Herbrand model semantics and the operational 
semantics are preserved (this fact is a consequence of Theorem |SJ Theorem H3 and Proposition 

A safe clause for which Properties N1-N5 hold, is said to be in normal form. If a clause C is in 
normal form, then by Property N2, every disequation in bd{C) occurs also in grd{C). 

6.2 The Unfold-Simplify Subsidiary Strategy 

The Unfold-Simplify strategy first unfolds the clauses in Cls w.r.t. the leftmost atom in their body, 
and then it keeps unfolding the derived clauses as long as input variables are not instantiated. Now, in 
order to give the formal definition of the Unfold-Simplify strategy we introduce the following concept. 

Definition 13 (Consumer Atom) Let P be a program and M a mode for P. A non-basic atom 
q(ti, . . . , tk) is said to be a consumer atom iff for every non-unit clause in P whose head unifies with 
that non-basic atom via an mgu we have that for i = 1, . . . , k, if ti is an input argument of q then 
ijt? is a variant of t{. 

The Unfold-Simplify strategy is realized by the following Unfold-Simplify procedure, where the 
expression Simplify(S) denotes the set of clauses derived from a given set S of clauses by: (1) first, 
applying whenever possible the equation elimination rule to the clauses in S, (2) then, rewriting the 
derived clauses into their normal form, and (3) finally, applying as long as possible the subsumption 
rule. 



Procedure Unfold- Simplify(Cls, UnfoldedCls). 

Input: A set Cls of clauses in a program P and a mode M s for P. P is safe w.r.t. M s and for each 
C € Cls, the input variables of the leftmost non-basic atom in the body of C are input variables of 
the head of C. 

Output: A new set UnfoldedCls of clauses which are derived from Cls by applying the safe unfolding, 
equation elimination, disequation replacement, and subsumption rules. The clauses in UnfoldedCls 
are safe w.r.t. M s . 

(1) Unfold w.r.t. Leftmost Non-basic Atom: 

UnfoldedCls := {E | there exists a clause C G Cls and clause E is derived by unfolding C w.r.t. 

the leftmost non-basic atom in its body}; 
UnfoldedCls : = Simplify {UnfoldedCls) 

(2) Unfold w.r.t. Leftmost Consumer Atom: 

while there exists a clause C € UnfoldedCls whose body has a leftmost consumer atom, say A, 
such that the unfolding of C w.r.t. A is safe do 

UnfoldedCls := {UnfoldedCls - {C}) U {E \ E is derived by unfolding C w.r.t. A}; 

UnfoldedCls := Simplify {UnfoldedCls) 

end-while 

Notice that our assumptions on the input program P and clauses Cls ensure that the first unfolding 
step performed by the Unfold-Simplify procedure is safe. 

Notice also that our Unfold-Simplify strategy may fail to terminate. We will briefly return to this 
issue in Section El 

Our Unfold-Simplify strategy differs from usual unfolding strategies for (conjunctive) partial de- 
duction (see, for instance, [El El EH1 E] ) , because mode information is used. We have found this 
strategy very effective on several examples as shown in the following Section [7| 
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6.3 The Partition Subsidiary Strategy 

The Partition strategy is realized by the following procedure, where we will write p(t, u) to denote an 
atom with non-basic predicate p of arity k (> 0) , such that: (i) t is an /i-tuple of terms, with 0<h<k, 
denoting the h input arguments of p, and (ii) u is a (k — /i)-tuple of terms denoting the arguments of 
p which are not input arguments. 



Procedure Partition(NonunitCls , PartitionedCls). 

Input: A set NonunitCls of non-unit clauses in normal form and without variables in common. A 
mode M s for NonunitCls. The clauses in NonunitCls are safe w.r.t. M s . 

Output: A set PartitionedCls of clauses which is the union of disjoint packets of clauses such that: 
(2a) each packet is a set of clauses of the form (modulo renaming of variables): 

!H <— Diseqs, G\ 
H <— Diseqs, G m 

where Diseqs is a conjunction of disequations and for k = 1, . . . , m, no disequation occurs in G^, and 
(2b) for any two clauses C\ and C 2 , if the packet of C\ is different from the packet of C 2 , then C\ and 
C2 are mutually exclusive w.r.t. mode M s . 

The clauses in PartitionedCls are in normal form and they are safe w.r.t. M s . 

while there exist in NonunitCls two clauses of the form: 

C x . p(ti,iti) <- Body x 
C 2 . p(t 2 ,u 2 ) <- Body 2 

such that: (i) C\ and C 2 are not mutually exclusive w.r.t. mode M s , and either 

(11.1) t\ is not a variant of t 2 or 

(11. 2) t\ is a variant of £2 via an mgu $ such that ti"$ = t2, and for any substitution p which is a bijective 
mapping from the set of local variables of grd(C\-&) in Ci$ onto the set of local variables of grd(C 2 ) 
in C 2 , grd(C\{)p) cannot be made syntactically equal to grd(C 2 ) by applying disequation promotion 
do 

We take a binding X/r as follows. 

(Case 1) Suppose that t\ is not a variant of t 2 . In this case, since C\ and C 2 are not mutually 
exclusive, we have that t\ and t 2 are unifiable and, for some i,j € {1, 2}, with iy^j, there exists 
an mgu d of U and ij and a binding y/i in 1? such that tj{Y/t a } is not a variant of tj. Without 
loss of generality we may assume that i = 1 and j = 2. Then we take the binding X/r to be y/t a . 

(Case 2) Suppose that t\ is a variant of £2 via an mgu 1?. Now every safe clause whose normal 
form has a disequation of the form X 7^ t, where X is a local variable of that disequation in 
that clause, is mutually exclusive w.r.t. any other safe clause. This is the case because, for any 
substitution a which does not bind X, to is unifiable with X and, thus, X^to is not satisfiable. 
Thus, for some i,j € {1,2}, with ij^j, there exists a disequation (Yj^t a )'d in grd(Ci-d) where 
Y$ is an input variable of hd{Ci'd), such that for any substitution p which is a bijective mapping 
from the set of local variables of grd(Ci$) in Cji? onto the set of local variables of grdifij-ff) in 
Cj-Q and for every disequation (Z^t^-d in grd(Cj$), we have that (Y^t a )-&p is different from 
(Z^tb)^. We also have that Yfl is an input variable of hd(Cj-d). Without loss of generality 
we may assume that i = l, j = 2, t\ , & = t 2 , and C 2 d = C 2 . Then we take the binding X/r to be 
(Y/t a )$. 

We apply the case split rule to clause C 2 w.r.t. X/r, that is, we derive the two clauses: 

C21. (p(t 2 ,u 2 ) <- Sody 2 ){X/r} 
C 22 - p(t 2 ,u 2 ) <- X^r,Body 2 

We update the value of NonunitCls as follows: 

NonunitCls := (NonunitCls - {C 2 }) U {C 2 i, C 22 } 
NonunitCls := Simplify (NonunitCls). 
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end-while 



Now the set NonunitCls is partitioned into subsets of clauses and after suitable renaming of variables 
and disequation promotion, each subset is of the form: 

p(t,u±) <— Diseqs, Goal\ 

< 

p(t,u m ) <— Diseqs, Goal m 
where Diseqs is a conjunction of disequations and for k = 1, . . . , m, no disequation occurs in Goal^, 
and any two clauses in different subsets are mutually exclusive w.r.t. mode M s . 

Then we process every subset of clauses we have derived, by applying the safe head generalization 
rule so to replace the non-input arguments in the heads of the clauses belonging to the same subset 
by their most specific common generalization. Thus, every subset of clauses will eventually take the 
form: 

p(t, u) <— Eqs 1 , Diseqs, Goal\ 

< 

p(t, u) <— Eqs m , Diseqs, Goal m 

where u is the most specific common generalization of the terms u±, . . . , u m and, for k = 1, . . . , m, the 
goal Eqs k is a conjunction of the equations V\ = v\, . . . , V r = v r such that u{V\/v\, . . . , V r /v r } = ut- 

Finally, we move all disequations to the leftmost positions of the body of every clause whereby getting 
the set PartitionedCls. 



Notice that in the above procedure the application of the case split rule to clause C2 w.r.t. X/r is safe 
because: (i) clauses C\ and C2 are safe w.r.t. M s , (ii) X is an input variable of hd{C22) (recall that 
our choice of X/r in Case 2 ensures that X is an input variable of hd(C2)), and (iii) each variable in 
r is either an input variable of hd{C22) or a local variable of X^r in 622- Thus, clauses C21 and C22 
are safe w.r.t. mode M s and they are also mutually exclusive w.r.t. M s . 

The following property is particularly important for the mechanization of our Determinization 
Strategy. 

Theorem 8 The Partition procedure terminates. 

Proof: See Appendix C. □ 

When the Partition procedure terminates, it returns a set PartitionedCls of clauses which is the 
union of packets of clauses enjoying Properties (2a) and (2b) indicated in the Output specification of 
that procedure. These properties are a straightforward consequence of the termination condition of 
the while-do statement of that same procedure. 

6.4 The Define-Fold Subsidiary Strategy 

The Define-Fold strategy is realized by the following procedure. 

Procedure Define- Fold(PartitionedCls , Defs, NewDefs, FoldedCls). 

Input: (i) A mode M s , (ii) a set PartitionedCls of clauses which are safe w.r.t. M s , and (iii) a set 
Defs of definition clauses. PartitionedCls is the union of the disjoint packets of clauses computed by 
the Partition subsidiary strategy. 

Output: (i) A (possibly empty) set NewDefs of definition clauses, together with a mode M new con- 
sisting of exactly one mode for each distinct head predicate in NewDefs. For each C <G NewDefs, the 
input variables of the leftmost non-basic atom in the body of C are input variables of the head of C. 
(ii) A set FoldedCls of folded clauses. 

NewDefs := 0; M new := 0; FoldedCls := 0; 

while there exists in PartitionedCls a packet Q of the form: 
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{H <— Diseqs, G\ 
H <— Diseqs, G rn 

where Diseqs is a conjunction of disequations and for k = 1, . . . , m, no disequation occurs in Gk, 
do PartitionedCls := PartitionedCls — Q and apply the definition and safe folding rules as follows. 

(Case a) Let us suppose that the set Defs of the available definition clauses contains a subset of 
clauses of the form: 

newq(X 1 , . . . , X h ) <- G\ 

< 

newq(X 1 , . . .,X h ) <- G m 

such that: (i) they are all the clauses in Defs for predicate newq, (ii) X±, . . . ,Xh include every 
variable which occurs in one of the goals G%, . . . , G m and also occurs in one of the goals H, Diseqs 
(this property is needed for the correctness of folding, see Section f4.1|l . and (iii) for i = 1, . . . , h, if 
Xi is an input argument of newq then Xi is either an input variable of H (according to the given 
mode M s ) or an input variable of the leftmost non-basic atom of one of the goals Cxi, ... , G m . 
Then we fold the given packet and we get: 

FoldedCls := FoldedCls U {H <— Diseqs, newq(X\, . . . , Xh)} 

(Case (3) If in Defs there is no set of definition clauses satisfying the conditions described in Case (a), 
then we add to NewDefs the following clauses for a new predicate newr: 

!newr(X 1 , . . . ,X h ) <- Gi 
newr (X x , ... ,X h ) <- G m 

where, for i = 1, . . . , h, either (i) X{ occurs in one of the goals Cxi, ... , G m and also occurs in one 
of the goals H, Diseqs, or (ii) Xi is an input variable of the leftmost non-basic atom of one of 
the goals G\, . . . , G m . We add to M new the mode newr (mi, . . . , m^) such that for i = 1, . . . , h, 
rrii = + iff Xi is either an input variable of H or an input variable of the leftmost non-basic atom 
of one of the goals G\, . . . , G m . We then fold the packet under consideration and we get: 

FoldedCls := FoldedCls U {H <— Diseqs, newr(X\, . . . , Xh)} 
end-while 



Notice that the post-conditions on the set NewDefs which is derived by the Define- Fold procedure (see 
Point (i) of the Output of the procedure) , ensure the satisfaction of the pre-conditions on the set Cls 
which is an input of the Unfold-Simplify procedure. Indeed, recall that the set Cls is constructed during 
the Determinization Strategy by the assignment Cls := NewDefs. Recall also that these pre-conditions 
are needed to ensure that the first unfolding step performed by the Unfold-Simplify procedure is safe. 

Notice also that each application of the folding rule is safe (see Definition [5J) . This fact is implied 
in Case (a) by Condition (iii), and in Case (j3) by the definition of the mode for newr. 

Finally, notice that the Define-Fold procedure terminates. However, this procedure does not guar- 
antee the termination of the specialization process, because at each iteration of the while-do loop of the 
Determinization Strategy, the Define-Fold procedure may introduce a nonempty set of new definition 
clauses. We will briefly discuss this issue in Section 

7 Examples of Application of the Determinization Strategy 

In this section we will present some examples of program specialization where we will see in action 
our Determinization Strategy together with the Unfold-Simplify, Partition, and Define-Fold subsidiary 
strategies. 
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7.1 A Complete Derivation: Computing the Occurrences of a Pattern in a String 

We consider again the program Match-Pos of Section [221 The mode M for the program Match_Pos is 
{match _pos{+, +, ?), append(7, ?, +), length(+, ?)}. We leave it to the reader to verify that Match-Pos 
satisfies M. 

The derivation we will perform using the Determinization Strategy is more challenging than the 
ones presented in the literature (see, for instance, ^D^1E3E3IB]) because an occurrence of the 
pattern P in the string S is specified in the initial program (see clause 1) in a nondeterministic way 
by stipulating the existence of two substrings L and R such that S is the concatenation of L, P, and 
R. 

We want to specialize the Match-Pos program w.r.t. the atom match_pos([a, a, b], S, N). Thus, we 
first introduce the definition clause: 

6. match_pos s (S, N) <— match_pos([a, a, b], S, N) 

The mode of the new predicate is match_pos s (+, ?) because S is an input argument of match_pos and 
./V is not an input argument. Our transformation strategy starts off with the following initial values: 
Defs = Cls = {6}, TransfP = Match_Pos, and M s = M U {match_pos s (+, ?)}. 

First iteration 

Unfold- Simplify. By unfolding clause 6 w.r.t. the leftmost atom in its body we derive: 

7. match-pos s (S, N) <— append(Y, R, S), append(L,[a,a,b],Y), length(L,N) 

The body of clause 7 has no consumer atoms (notice that, for instance, the mgu of append(Y, R, S) and 
the head of clause 5 has the binding S*/[A|Z] where S is an input variable). Thus, the Unfold-Simplify 
subsidiary strategy terminates. We have: UnfoldedCls = {7}. 

Partition. NonunitCls is made out of clause 7 only, and thus, the Partition subsidiary strategy 
immediately terminates and produces a set PartitionedCls which consists of a single packet made out 
of clause 7. 

Define-Fold. In order to fold clause 7 in PartitionedCls, the Define-Fold subsidiary strategy introduces 
the following definition clause: 

8. newl(S, N) <— append (Y, R, S), append(L,[a,a,b],Y), length(L,N) 
The mode of newl is newl(+, ?). By folding clause 7 using clause 8 we derive: 

9. match-pos s (S, N) <— newl(S,N) 

Thus, the first iteration of the Determinization Strategy terminates with Defs = {6,8}, Cls = {8}, 
TransfP = Match.Pos U {9}, and M S =MU {match_pos s (+, ?), newl(+, ?)}. 

Second iteration 

Unfold-Simplify. We follow the subsidiary strategy described in Section E21 and we first unfold clause 
8 in Cls w.r.t. the leftmost atom in its body. We get: 

10. newl(S, N) <— append(L, [a, a, b], [ ]), length(L,N) 

11. newl([C\S], N) <- append (Y, R, S), appendjL, [a,a,b], [C\Y]) , length(L,N) 

Now we unfold clauses 10 and 11 w.r.t. the leftmost consumer atom of their bodies (see the underlined 
atoms). The unfolding of clause 10 amounts to its deletion because the atom append(L, [a, a, b], [ ]) is 
not unifiable with any head in program Match_Pos. The unfolding of clause 11 yields two new clauses 
that are further unfolded according to the Unfold-Simplify subsidiary strategy. After some unfolding 
steps, we derive the following clauses: 

12. newl([a\S],0) <— append ([o, b], R, S) 

13. newl([C\S], s(N)) <— append (Y, R, S), append(L,[a,a,b],Y), length(L,N) 

Partition. We apply the safe case split rule to clause 13 w.r.t. to the binding C/a, because the input 
argument in the head of this clause is unifiable with the input argument in the head of clause 12 via 
the mgu {C/a}. We derive the following two clauses: 
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14. newl([a\S], s(N)) <— append (Y, R, S), append(L,[a, a,b],Y), length(L,N) 

15. newl([C\S],s(N)) ^C^a, append (Y, R, S), append(L,[a,a,b],Y), length(L,N) 

Now, the set of clauses derived so far by the Partition subsidiary strategy can be partitioned into two 
packets: the first one is made out of clauses 12 and 14, where the input argument of the head predicate 
is of the form [a\S], and the second one is made out of clause 15 only, where the input argument of 
the head predicate is of the form [C\S] with C^a. 

The Partition subsidiary strategy terminates by applying the safe head generalization rule to 
clauses 12 and 14, so to replace the second arguments in their heads by the most specific common 
generalization of those arguments, that is, a variable. We get the packet: 

16. newl([a\S],M) <- M = 0, append ([a, b], R, S) 

17. new 1 ([a \ S], M) <-M = s(N), append (Y, R, S), append(L,[a,a,b],Y), length{L,N) 

For the packet made out of clause 15 only, no application of the safe head generalization rule is 
performed. Thus, we have derived the set of clauses PartitionCls which is the union of the two packets 
{16,17} and {15}. 

Define-Fold. Since there is no set of definition clauses which can be used to fold the packet {16, 17}, 
we are in Case (a) of the Define-Fold subsidiary strategy. Thus, we introduce a new predicate new2 
as follows: 

18. new2(S,M) <- M = 0, append ([a, b], R, S) 

19. new2(S,M) <-M = s(N), append (Y, R, S), append(L,[a,a,b],Y), length(L,N) 

The mode of new2 is new2(+,7) because S is an input variable of the head of each clause of the 
corresponding packet. By folding clauses 16 and 17 using clauses 18 and 19 we derive the following 
clause: 

20. newl([a\S], M) «- new2(S, M) 

We then consider the packet made out of clause 15 only. This packet can be folded using clause 8 in 
Defs. Thus, we are in Case {(5) of the Define-Fold subsidiary strategy. By folding clause 15 we derive 
the following clause: 

21. newl([C\S],s(N)) <- C^a, newl(S,N) 
Thus, FoldedCls is the set {20,21}. 

After these folding steps we conclude the second iteration of the Determinization Strategy with 
the following assignments: Defs := Defs U {18,19}; Cls := {18,19}; TransfP := TransfP U {20,21}; 
M s := M s U{new2(+,?)}. 

Third iteration 

Unfold- Simplify. From Cls, that is, clauses 18 and 19, we derive the set UnfoldedCls made out of the 
following clauses: 

22. new2([a\S],0) <- append ( [b], R, S) 

23. new2([a\S], s(0)) <— append([a,b], R, S) 

24. new2([C\S], s(s(N))) <- append (Y,R,S), append(L,[a,a,b],Y), length(L,N) 

Partition. The set NonunitCls is identical to UnfoldedCls. From NonunitCls we derive the set Parti- 
tionedCls which is the union of two packets. The first packet consists of the following clauses: 

25. new2([a\S],M) ^ M = 0, append ([6], R, S) 

26. new2([a\S],M) «- M = s(0), append ([a, b], R, S) 

27. new2([a\S], M) <- M = s(s(N)), append (Y, R, S), append(L,[a,a,b],Y), length(L,N) 
The second packet consists of the following clause only: 

28. new2([C\S], s(s(N))) <- C^a, append (Y, R, S), append(L,[a,a,b],Y), length(L,N) 

Define-Fold. We introduce the following definition clauses: 

29. new3(S,M) <- M = 0, append ( [b] ,R, S) 

30. new3{S,M) <- M = s(0), append ([a, b], R, S) 

31. new3(S,M) <- M = s(s(N)), append (Y, R, S), append(L,[a,a,b],Y), length(L,N) 
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where the mode for new3 is neu>3(+, ?). By folding, from PartitionedCls we derive the following two 
clauses: 

32. new2([a\S],M) «- new3(S, M) 

33. new2([C\S],s(s(N))) <- C^a, newl(S,N) 

which constitute the set FoldedCls. 

The third iteration of the Determinization Strategy terminates with the following assignments: 
Defs := Defs U {29,30,31}; Cls := {29,30,31}; TransfP := TransJP U {32, 33}; M s := M s U 
{new3(+,?)}. 

Fourth iteration 

Unfold- Simplify. From Cfe we derive the new set UnfoldedCls made out of the following clauses: 

34. new3([b\S],0) ^ append([],R,S) 

35. new 3 ([a IS*], s(0)) <— append ( [6] , i?, 5") 

36. neiy3([a|5],s(s(0))) <- append ([o, 6], i?, 5) 

37. ne«;3([C|5],s(s(s(iV)))) «- append (Y, R, S) , append(L,[a,a,b],Y), length(L,N) 

Partition. The set NonunitCls is identical to UnfoldedCls. From NonunitCls we derive the new set 
PartitionedCls made out of the following clauses: 

38. new3([a\S],s(M)) <- M = 0, append ( [6] , i?, S") 

39. ne«;3([a|5],s(M)) <- M = s(0), append ([a, 6], 5) 

40. new3([a|S , ],s(M)) <- M = s(s(JV)), append (Y, R, S), append(L,[a,a,b],Y), length(L,N) 

41. new3([6|S],M) <- M = 0, append{[ ],R,S) 

42. new3([&|S],M) <- M = s(s(s(N))), append (Y, R, S), append(L,[a,a,b],Y), length(L,N) 

43. new3([C\S],s(s(s(N)))) ^C^a,C^b, append(Y, R, S), append(L, [a, a, b],Y), length(L, N) 

PartitionedCls consists of three packets: {38,39,40}, {41,42}, and {43}. 

Define-Fold. We introduce two new predicates by means of the following definition clauses: 

44. new4(S,M) <- M = 0, append ([ },R,S) 

45. newA(S,M) <- M = s(s(s(N))), append (Y, R, S), append(L,[a,a,b],Y), length(L,N) 

We now fold the clauses in PartitionedCls and we derive the set FoldedCls made out of the following 
clauses: 

46. new3([a\S], s(M)) <- neu>3(i?, S) 

47. new3([6|S], M) <- newA(R, S) 

48. new3([C|S],s(s(s(iV)))) «- C^a, C^b, newl(S,N) 

The fourth iteration terminates with the following assignments: Defs := Defs U {44,45}; Cls := 
{44,45}; TransfP := Trans/P U {46,47,48}; M s := M s U {new4(+, ?)}. 

Fifth iteration 

Unfold- Simplify. From Cls we derive the new set UnfoldedCls made out of the following clauses: 

49. new4(S,0) ^- 

50. new4([a\S],s(s(s(0)))) «- append([a, b],R, S) 

51. neu;4([C|5],s(s(s(s(JV))))) <- append(Y,R, S), append(L,[a,a,b],Y), length(L,N) 

Partition. The set NonunitCls is made out of clauses 50 and 51. From NonunitCls we derive the new 
set PartitionedCls made out of the following clauses: 

52. new4([a\S],s(s(s(M)))) ^ M = 0, append ([a, b], R, S) 

53. new4([a|S],s(s(s(M)))) <- M = s(N), append (Y, R, S), append(L,[a,a,b],Y), length(L,N) 

54. new4([C\S],s(s(s(s(N))))) <- C^a, append (Y, R, S), append(L,[a,a,b],Y), length(L,N) 

PartitionedCls consists of two packets: {52,53} and {54}. 

Define-Fold. We are able to perform all required folding steps without introducing new definition 
clauses (see Case (a) of the Define-Fold procedure). In particular, (i) we fold clauses 52 and 53 using 
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4t- return N 



true 



© 



^a, N:=N+4 



Figure 1: The finite automaton with counter N which corresponds to Match_Pos s . 



clauses 18 and 19, and (ii) we fold clause 54 using clause 8. Since no new definition is introduced, 
the set Cls is empty and the transformation strategy terminates. Our final specialized program is the 
program Match_Pos s shown in Section f5. 31 

The Match_Pos s program is semideterministic and it corresponds to the finite automaton with 
one counter depicted in Fig. ^ The predicates correspond to the states of the automaton and the 
clauses correspond to the transitions. The predicate newl corresponds to the initial state, because the 
program is intended to be used for goals of the form match-pos s (S, N), where S is bound to a list of 
characters, and by clause 1 match-pos s {S,N) calls newl(S, N). Notice that this finite automaton is 
deterministic except for the state corresponding to the predicate newA, where the automaton can either 
(i) accept the input string by returning the value of N and moving to the final state true, even if the 
input string has not been completely scanned (see clause 49), or (ii) move to the state corresponding 
to new2, if the symbol of the input string which is scanned is a (see clause 55), or (iii) move to the 
state corresponding to newl, if the symbol of the input string which is scanned is different from a (see 
clause 56). 

7.2 Multiple Pattern Matching 

Given a list Ps of patterns and a string S we want to compute the position, say N, of any occurrence 
in S of a pattern which is a member of the list Ps. For any given Ps and S the following program 
computes N in a nondeterministic way: 

Program Mmatch (initial, nondeterministic) 

1. mmatch([P\Ps], S, N) <- match.pos{P, S, N) 

2. mmatch([P\Ps\,S,N) <- mmatch(Ps, S, N) 

The atom mmatch{Ps,S,N) holds iff there exists a pattern in the list Ps of patterns which occurs 
in the string S at position N. The predicate match jpos is defined as in program Match-Pos of 
Section 17. 11 and its clauses are not listed here. We consider the following mode for the program 
Mmatch: {mmatch(+, +, ?), match -pos {+,+,!), append (?,?,+), length(+, ?)}. 

We want to specialize this multi-pattern matching program w.r.t. the list [[a, a, a], [a, a, b]] of 
patterns. Thus, we introduce the following definition clause: 

3. mmatch s {S,N) <— mmatch([[a, a, a], [a, a,b]], S, N) 
The mode of the new predicate is mmatch s (+, ?) because S is an input argument of mmatch and iV is 
not an input argument. Thus, our Determinization Strategy starts off with the following initial values: 
Defs = Cls = {3}, TransfP = Mmatch, and M s = M U {mmatch s (+, ?)}. 
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The output of the Determinization Strategy is the following program Mmatch s : 



Program Mmatch s (specialized, semideterministic) 

4. mmatch s (5 ', N) <— newl(S, N) 

5. newl([a\S], M) <- new2(S,M) 

6. ncu>l([C|5],a(J\0)<-C7^o, newl(S,N) 

7. neu>2([o|5],Af) <- neto3(S,M) 

8. new2([C\S],s(s(N))) «- C^a, newl(S,N) 

9. new3([a\S], M) <- newA(S,M) 

10. neu>3([6|S], M) <- net»5(5, M) 

11. new3([C|5],s(s(s(iV)))) <- C^a, C^6, newl(S,N) 

12. new4(5',0)^ 

13. netfl4([a|S], s(iV)) «- new4(5, iV) 

14. neto4([6|5], s(N)) <- new5(S, N) 

15. newA([C\S],s(s(s(s(N))))) ^C^a, C^b, newl(S,N) 

16. new5(S, 0) <— 

17. new5([a|5], s(s(s(N)))) «- new2(S, TV) 

18. new5([C|5],s(s(s(s(iV))))) <- C/a, 'newl(S,N) 

Similarly to the single-pattern string matching example of the previous Section l7.1( this specialized, 
semideterministic program corresponds to a finite automaton with counters. This finite automaton 
is deterministic, except for the states corresponding to the predicates newA and new5 where any 
remaining portion of the input word is accepted. A similar derivation cannot be performed by usual 
partial deduction techniques without a prior transformation into failure continuation passing style |44| . 

7.3 Prom Regular Expressions to Finite Automata 

In this example we show the derivation of a deterministic finite automaton by specializing a general 
parser for regular expressions w.r.t. a given regular expression. The initial program Reg_Expr for 
testing whether or not a string belongs to the language denoted by a regular expression over the 
alphabet {a, b}, is the one given below. 



Program RegSxpr 


(initial, nondeterministic) 


1. in Janguag e (E, S) <— string (S), accepts (E,S) 




2. string([ ]) <— 




3. string{[a\S\) <— string(S) 




4. string([b\S]) <— string(S) 




5. accepts(E, [E]) <— symbol(E) 




6. accepts{E\E2, S) <— append(Si, S2, S), accepts (E\, S\) 


accepts (E2, S2) 


7. accepts {E1+E2, S) <— accepts (Ei, S) 




8. accepts {E1+E2, S) <— accepts (E2 , S) 




9. accepts [E*,[]) 




10. accepts (E* , S) <— ne _append (S 1, S2, S), accepts (E , Si) 


accepts (E* , S2) 


11. symbol(a) <— 




12. symbol{b) <— 




13. ne.append([A],Y, [A\Y]) <- 




14. ne. append ([A\X], Y, [A\Z\) <- ne .append (X, Y, Z) 





We have that inJanguage(E, S) holds iff S is a string in {a, b}* and S belongs to the language denoted 
by the regular expression E. In this Reg.Expr program we have used the predicate ne_append(Si, S2, S) 
which holds iff the non-empty string S is the concatenation of the nonempty string Si and the string 
5*2. The use of the atom ne-append(S\, S2, S) in clause 10 ensures that we have a terminating program, 
that is, a program for which we cannot have an infinite derivation when starting from a ground goal. 
Indeed, if in clause 10 we replace ne -append {S 1, S2, S) by append (S '1, S2, S), then we may construct 
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an infinite derivation because from a goal of the form accepts (E* , S) we can derive a new goal of the 
form (accepts(E, [ ]), accepts (E* , S)) . 

We consider the following mode for the program RegSxpr: 
{inJanguage(+,+), string{+), accepts(+,+), symbol{+), ne -append (?,?,+), append (?,?,+)}. 

We use our Determinization Strategy to specialize the program RegSxpr w.r.t. the atom 
inJanguage((aa* (b+bb))* , S). Thus, we begin by introducing the definition clause: 

15. inJanguage s (S) <— inJanguage((aa*(b+bb))*,S) 
The mode for this new predicate is inJanguage s (+) because S is an input argument of inJanguage. 
The output of the Determinization Strategy is the following specialized program Reg_Expr s : 



Pro 


gram RegSxpr s 


(specialized, semideterministic) 


16. 


inJanguage s (S) <— newl(S) 




17. 


newl{[ ]) <— 






18. 


newl([a \S]) <- 


- new2(S) 




19. 


new 2 ( [a \ S]) <- 


- new3(S) 




20. 


new2([b\S]) <- 


- new4(S) 




21. 


new3([a 5]) <- 


- new3(S) 




22. 


new3([6|5']) <- 


- newA(S) 




23. 


neu>4([ ]) <— 






24. 


newA([a\S]) *■ 


- new2(S) 




25. 


new4([b\S)) *- 


- newl(S) 





This specialized program corresponds to a deterministic finite automaton. 



7.4 Matching Regular Expressions 

The following nondeterministic program defines a relation re-match(E, S), where E is a regular ex- 
pression and S is a string, which holds iff there exists a substring P of S such that P belongs to the 
language denoted by E: 

Program RegSxpr_Match (initial, nondeterministic) 

1. re -.match (E, S) <— append (Y, R, S), append(L, P,Y), accepts(E,P) 

The predicates append and accepts are defined as in the programs Naive-Match (see Section 13. 3 1) and 
Reg-Expr (see Section ing , respectively, and their clauses are not listed here. We consider the following 
mode for the program Reg -Expr -Match: {append (?,?,+), accept(+,+), re -match (+,+)}• 

We want to specialize the program RegSxpr-Match w.r.t. the regular expression aa*b. Thus, we 
introduce the following definition clause: 

2. re-match s (S) <— re -match (aa*b, S) 
The mode of this new predicate is re-match s {+) because S is an input argument of re-match. The 
output of the Determinization Strategy is the following program: 

Program Reg Sxpr -Match s (specialized, semideterministic) 



3. 


re-match s (S) 


e— newl(S) 


4. 


newl([a\S}) <- 


- new2(S) 


5. 


newl([C\S}) <- 


- C^a, newl(S) 


6. 


new2([a\S]) <- 


- new3(S) 


7. 


new2{[C\S)) <- 


- C^a, newl(S) 


8. 


new3([a \S]) <- 


- new4(S) 


9. 


new3([b\S]) <- 


new3(S) 


10. 


new3([C\S}) <- 


- C^a, C^b, newl(S) 


11. 


newA(S) <— 





Similarly to the single-pattern string matching example of Section f3. 31 this specialized, semidetermin- 
istic program corresponds to a deterministic finite automaton. 
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7.5 Specializing Context-free Parsers to Regular Grammars 

Let us consider the following program for parsing context-free languages: 



Program CF_Parser 

1. string _parse(G, A, W) <— string (W), parse(G, A,W) 

2. string ([ ]) <— 

3. string ([0 1 W}) <— string (W) 

4. string ([1\W]) <— string (W) 

5. parse(G, [],[]) <- 

6. parse(G, [A\X], [A\Y]) «- terminal(A), parse(G,X,Y) 

7. parse(G,[A\X],Y) <— nonterminal (A), member (A — > B,G), 

append (B , X, Z), parse(G,Z,Y) 

8. mem6er(,4, L4|X]) <- 

9. member (A, [B\X]) <— member(A,X) 



(initial, nondeterministic) 



together with the clauses for the predicate append defined as in program Match-Pos (see Section 17.1(1 , 
and the unit clauses stating that and 1 are terminals and s,u,v, and n; are nonterminals. The first 
argument of parse is a context-free grammar, the second argument is a list of terminal and nonterminal 
symbols, and the third argument is a word represented as a list of terminal symbols. We assume that a 
context-free grammar is represented as a list of productions of the form x — > y, where x is a nonterminal 
symbol and y is a list of terminal and nonterminal symbols. We have that parse(G, [s], W) holds iff from 
the symbol s we can derive the word W using the grammar G. We consider the following mode for the 
program CF_Parser: { string _parse(+, +, +), string (+), parse (+, +, +), terminal{+), nonterminal (+), 
member(7, +), append(+, +, ?)}. 

We want to specialize our parsing program w.r.t. the following regular grammar: 



s - 
u 

V 

w 
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1 
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w 
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To this aim we apply our Determinization Strategy starting from the following definition clause: 



W) 



10. string -parse s (W) <— parse{[ s — ► [0, n], 


s - 




s - 


+ [0,w] 


u-[0], ' 


u - 


- [0,n], 


u - 


-> M 


v^[0], 


v - 


- M, 


v - 


- [0,n] 




w - 


- [o,«>] 






The mode for this new predicate is string .parse. 


(+)■ 


The output 


of the 



the following specialized program CF_Parser s : 



Program CF _Parser s 



(specialized, semideterministic) 



11. 


string _parse s (W) <— neu>l 


12. 


raeiul([0|W]) «- 


- new2(W) 


13. 


neto2([0|W]) «- 


- new3(W) 


14. 


neu>2([l|W]) <- 


- new4(W) 


15. 


neu>3([ ]) <— 




16. 


ne«>3([0|W]) <- 


- new5(W) 


17. 


netfl3([l|W]) *- 


- newitJV) 


18. 


newA{[ ]) <— 




19. 


new5([ ]) <— 




20. 


new5([0|T^]) <- 


- new3(W) 


21. 


neti>5([l|W]) *- 


- newi(W) 



This program corresponds to a deterministic finite automaton. 

Now, we would like to discuss the improvements we achieved in this example by applying our 
Determinization Strategy. Let us consider the derivation tree T\ (see Fig. I^J) generated by the initial 
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string _parse(g , [s], [0™1]) 



string{[O n l}),parse(g, [s], [0™1]) 



parse(g,[s], [0 n l]) 




parse(g, [w], [0™ 



parse(g, [w], [0™ 2 1]) 



true 



no successes 



Figure 2: Derivation tree T\ for string _parse(g, [s], [0"T]). 



program CF_Parser starting from the goal string -parse(g, [s], [0 n l]), where g denotes the grammar 
w.r.t. which we have specialized the CF_Parser program and [0 n l] denotes the list [0, . . . , 0, 1] with n 
occurrences of 0. The nodes of T\ are labeled by the goals derived from string -parse(g, [s], [0 n l]). In 
particular, the root of the derivation tree is labeled by string -parse(g, [s], [0 n l]) and a node labeled by 
a goal G has k children labeled by the goals G±, . . . , which are derived from G (see Section l2~3)) . The 
tree T\ has a number of nodes which is 0(2 n ). Thus, by using the initial program CF_Parser it takes 
0(2 n ) number of steps to search for a derivation from the root goal string -parse(g, [s], [0 n l]) to the 
goal true. (Indeed, this is the case if one uses a Prolog compiler.) In contrast, by using the specialized 
program CF_Parser s , it takes 0(n) steps to search for a derivation from the goal string _parse s ([0 n l]) 
to true, because the derivation tree T2 has a number of nodes which is 0(n) (see Fig. 0J). 

The improvement of performance is due to the fact that our Determinization Strategy is able to 
avoid repeated derivations by introducing new definition clauses whose bodies have goals from which 
common subgoals are derived. Thus, after performing folding steps which use these definition clauses, 
we reduce the search space during program execution. 

For instance, our strategy introduces the predicate neufl defined by the following clauses: 

new2(W) <— string(W), parse(g, [u],W) 
new2(W) <— string(W), parse(g, [v],W) 
new2(W) <— string(W), parse(g, [w],W) 

whose bodies are goals from which common subgoals are derived for W = [0 n_1 l] and n > 2. Indeed, for 
instance, parse(g, [u], [0 n-2 l]) can be derived from both parse(g, [u], [0 n-1 l]) and parse(g, [v], [0 n-1 l]) 
(see Fig. |2j). The reader may verify that by using the specialized program CF ' -Parser s no repeated 
goal is derived from string _parse s (g, [s], [0 n l]). 

The ability of our Determinization Strategy of putting together the computations performed by 
the initial program in different branches of the computation tree, so that common repeated subcom- 
putations are avoided, is based on the ideas which motivate the tupling strategy |34j . first proposed 
as a transformation technique for functional languages. 

8 Experimental Evaluation 

The Determinization Strategy has been implemented in the MAP program transformation system |39| . 
All program specialization examples presented in Sections 13.31 15.31 and have been worked out in 
a fully automatic way by the MAP system. We have compared the specialization times and the 
speedups obtained by the MAP system with those obtained by ECCE, a system for (conjunctive) 
partial deduction j^l]. All experimental results reported in this section have been obtained by using 
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string -parse s (g, [s] , [O n 1] ) (n > 2) 
newl([O n l]) 
new2([0 n - 1 l]) 
new3([0 n - 2 l]) 

true 

Figure 3: Derivation tree T<i for string -parse s ([O n l]). 

SICStus Prolog 3.8.5 running on a Pentium II under Linux. 

In Table 1 we consider the examples of Sections 13.31 15.31 and [71 and we show the times taken 
(i) for performing partial deduction by using the ECCE system, (ii) for performing conjunctive partial 
deduction by using the ECCE system, and (hi) for applying the Determinization Strategy by using 
the MAP system. The static input shown in Column 2 of Table 1 is the goal w.r.t. which we have 
specialized the programs of Column 1. For running the ECCE system suitable choices among the 
available unfolding strategies and generalization strategies should be made. We have used the choices 
suggested by the system itself for partial deduction and conjunctive partial deduction, and we made 
some changes only when specialization was not performed within a reasonable amount of time. For 
running the MAP system the only information to be provided by the user is the mode for the program 
to be specialized. The system assumes that the program satisfies this mode and no mode analysis is 
performed. 



Program 


Static Input 


ECCE 


ECCE 


MAP 






(PD) 


(CPD) 


(Det) 


Naive-Match 


naive -match{[aab], S) 


360 


370 


70 


Naive-Match 


naive-match([aaaaaaaaab], S) 


420 


2120 


480 


Match-Pos 


match jpos i\aab], S, N) 


540 


360 


100 


Match-Pos 


match-pos ( [aaaaaaaaab] ,S,N) 


650 


910 


500 


Mmatch 


mmatch ([[aaa], [aab]], S,N) 


1150 


1400 


280 


Mmatch 


mmatch([[aa], [aaa], [aab]], S, N) 


1740 


2040 


220 


RegSxpr 


inJanguage((aa* (b+bb))* , S) 


6260 


138900 


420 


RegSxpr 


inJanguage(a*(b+bb+bbb), S) 


3460 


5430 


230 


Reg -Expr -Match 


re -match (aa*b, S) 


970 


5290 


210 


Reg -Expr -Match 


re -match (a* (b + bb), S) 


1970 


11200 


300 


CF -Parser 


string -parse(g, [s], W) 


23400 


32700 


1620 


CF -Parser 


string -pars e(g\, [s], W) 


31200 


31800 


2000 



Table 1: Specialization Times (in milliseconds). 

The experimental results of Table 1 show that the MAP implementation of the Determinization 
Strategy is much faster than the ECCE implementation of both partial deduction and conjunctive 
partial deduction. We believe that, essentially, this is due to the fact that ECCE employs very sophis- 
ticated techniques, such as those based on homeomorphic embeddings, for controlling the unfolding 
and the generalization steps, and ensuring the termination of the specialization process. For a fair 
comparison, however, we should recall that Determinization may not terminate on examples different 
from those considered in this paper. 

We have already mentioned in Section 13.31 that the performance of the programs derived by the 
Determinization Strategy may be further improved by applying post-processing transformations which 
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exploit the semideterminism of the programs. In particular, we may: (i) reorder the clauses so that unit 
clauses appear before non-unit clauses, and (ii) remove disequations by introducing cuts instead. The 
reader may verify that these transformations preserve the operational semantics. For a systematic 
treatment of cut introduction, the reader may refer to |1( )1 I43| . As an example we now show the 
program obtained from Match-Pos s (see Section I5.3|) after the above post-processing transformations 
have been performed. 



Program Match-Pos cu t 


(specialized, with cuts) 


match-pos s (S,N) <— newl(S, N) 




newl{[a\S], M) <- !, new2(S,M) 




newl([C\S],s(N)) <- newl(S,N) 




new2([a\S],M) <- !, new3(S,M) 




new2([C[S],s(s(N))) «- newl{S,N) 




new3{[a\S], s(M)) «- !, new3(R, S) 




new3{[b\S],M) «- !, newA{R,S) 




new3([C\S],s(s(s(N)))) <- newl(S,N) 




new4(S,0) <- 




newA([a\S],s(s(s(M)))) <- !, new2(S,M) 




newA{[C\S], s(s(s(s(N))))) «- newl(S,N) 





In Table 2 below we report the speedups obtained by partial deduction, conjunctive partial deduction, 
Determinization, and Determinization followed by disequation removal and cut introduction. Every 
speedup is computed as the ratio between the timing of the initial program and the timing of the 
specialized program. These timings were obtained by running the various programs several times (up 
to 10,000) on significantly large input lists (up to 4,000 items). 



Program 


Static Input 


Speedup 


Speedup 


Speedup 


Speedup 






(PD) 


(CPD) 


(Det) 


(Det & Cut) 


Naive-Match 


naive -match ([aab], S) 


3.1 


5.8 xlO 3 


3.0 xlO 3 


6.8 xlO 3 


Naive-Match 


naive -match ([aaaaaaaaab], S) 


3.3 


6.9 xlO 3 


5.8 xlO 3 


12.4 xlO 3 


Match-Pos 


match jpos ([oab], S, N) 


1.6 


3.6 xlO 3 


1.8 xlO 3 


4.0 xlO 3 


Match-Pos 


match _pos([aaaaaaaaab], S, N) 


2.1 


5.3 xlO 3 


2.9 xlO 3 


8.1 xlO 3 


Mmatch 


mmatch ([[aaa], [aab]], S,N) 


1.7 


4.5 xlO 3 


3.5 xlO 3 


6.2 xlO 3 


Mmatch 


mmatch([[aa], [aaa], [aab]], S, N) 


1.6 


2.5 xlO 3 


3.9 xlO 3 


5.4 xlO 3 


RegSxpr 


inJanguage((aa*(b+bb))*, S) 


29.8 


6.2 xlO 3 


2.3 xlO 5 


3.9 xlO 5 


RegSxpr 


inJanguage(a*(b+bb+bbb), S) 


1.3 xlO 4 


3.3 xlO 4 


4.6 xlO 4 


5.7 xlO 4 


Reg -Expr -Match 


re -match (aa*b, S) 


5.7xl0 2 


2.7xl0 4 


1.5 xlO 5 


3.0 xlO 6 


Reg -Expr -Match 


re _match (a* (b + bb), S) 


2.1 xlO' 2 


3.4 xlO 3 


2.5 xlO 5 


4.1 xlO 5 


CF -Parser 


string _parse(g, [s], W) 


1.5 


1.5 


87.1 


87.1 


CF -Parser 


string -parse (gi, [s], W) 


1.1 


1.1 


61.3 


61.3 



Table 2: Speedups. 
To clarify the content of Table 2 let us remark that: 

Column 1 shows the names of the initial programs with reference to Sections I3.3| 15.31 and 

Column 2 shows the static input. The argument [aab] denotes the list [a, a, b]. Similar notation 
has been used for the other static input arguments. The argument g of the first stringjparse atom 
denotes the regular grammar considered in Example 17.51 The argument 51 of the last stringjparse 
atom denotes the regular grammar: 

{s — >0u, s— >lv, u — > 0, u—>Qv, u—>Qw, v — > 1, v^Ov, v— >lu, w —*■ 1, w — > lw}. 

Column 3, called Speedup (PD), shows the speedups we have obtained after the application of partial 
deduction. 
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Column 4, called Speedup (CPD), shows the speedups we have obtained after the application of 
conjunctive partial deduction. 

Column 5, called Speedup (Det), shows the speedups we have obtained after the application of the 
Determinization Strategy. 

Column 6, called Speedup (Det & Cut), shows the speedups we have obtained after the application of 
the Determinization Strategy followed by the removal of disequations and the introduction of cuts. 

Let us now discuss our experimental results of Table 2. In all examples the best speedups are those 
obtained after the application of the Determinization Strategy followed by the removal of disequations 
and the introduction of cuts (see column Det & Cut). 

As expected, conjunctive partial deduction gives higher speedups than partial deduction. 

In some cases, conjunctive partial deduction gives better results than Determinization (see the first 
5 rows of columns CPD and Det). This happens in examples where most nondeterminism is avoided by 
eliminating intermediate lists (see, for instance, the example of Section IM.HJI . In those examples, in fact, 
the Determinization Strategy may be less advantageous than conjunctive partial deduction because it 
introduces disequations which may be costly to check at runtime. However, as already mentioned, all 
disequations may be eliminated by introducing cuts (or, equivalently, if-then-else constructs) and the 
programs derived after disequation removal and cut introduction are indeed more efficient than those 
derived by conjunctive partial deduction (see column Det & Cut). 

For some programs (see, for instance, the entries for RegSxpr and CF_Parser) the speedups of 
the (Det) column are equal to the speedups of the (Det & Cut) column. The reason for this fact is the 
absence of disequations in the specialized program, so that the introduction of cuts does not improve 
efficiency. 

We would like to notice that further post-processing techniques are applicable. For instance, sim- 
ilarly to the familiar case of finite automata, we may eliminate clauses corresponding to e-transitions 
where no input symbols are consumed (such as clause 9 in program Match_Pos s ), and we may also 
minimize the number of predicate symbols (this corresponds to the minimization of the number of 
states). We do not present here these post-processing techniques because they are outside the scope 
of the paper. 

In summary, the experimental results of Table 2 confirm that in the examples we have considered, 
the Determinization Strategy followed by the removal of disequations in favour of cuts, achieves 
greater speedups than (conjunctive) partial deduction. However, it should be noticed that, as already 
mentioned, Determinization does not guarantee termination, while (conjunctive) partial deduction 
does, and in order to terminate in all cases, (conjunctive) partial deduction employs generalization 
techniques that may reduce speedups. In the next section we further discuss the issue of devising a 
generalization technique that ensures the termination of the Determinization Strategy. 

9 Concluding Remarks and Related Work 

We have proposed a specialization technique for logic programs based on an automatic strategy, called 
Determinization Strategy, which makes use of the following transformation rules: (1) definition intro- 
duction, (2) definition elimination, (3) unfolding, (4) folding, (5) subsumption, (6) head generalization, 
(7) case split, (8) equation elimination, and (9) disequation replacement. (Actually, we make use of 
the safe versions of the rules 4, 6, 7, and 8.) We have also shown that our strategy may reduce the 
amount of nondeterminism in the specialized programs and it may achieve exponential gains in time 
complexity. 

To get these results, we allow new predicates to be introduced by one or more non-recursive 
definition clauses whose bodies may contain more than one atom. We also allow folding steps using 
these definition clauses. By a folding step several clauses are replaced by a single clause, thereby 
reducing nondeterminism. 

The use of the subsumption rule is motivated by the desire of increasing efficiency by avoiding 
redundant computations. Head generalizations are used for deriving clauses with equal heads and 
thus, they allow us to perform folding steps. The case split rule is very important for reducing 



40 



nondeterminism because it replaces a clause, say C, by several clauses which correspond to exhaustive 
and mutually exclusive instantiations of the head of C. To get exhaustiveness and mutual exclusion, 
we allow the introduction of disequalities. To further increase program efficiency, in a post-processing 
phase these disequalities may be removed in favour of cuts. 

We assume that the initial program to be specialized is associated with a mode of use for its 
predicates. Our Determinization Strategy makes use of this mode information for directing the various 
transformation steps, and in particular, the applications of the unfolding and case split rules. Moreover, 
if our strategy terminates, it derives specialized programs which are semideterministic w.r.t. the given 
mode. This notion has been formally defined in Section 15.31 Although semideterminism is not in 
itself a guarantee for efficiency improvement, it is often the case that efficiency is increased because 
nondeterminism is reduced and redundant computations are avoided. 

We have shown that the transformation rules we use for program specialization, are correct w.r.t. 
the declarative semantics of logic programs based on the least Herbrand model. The proof of this 
correctness result is similar to the proofs of the correctness results which are presented in 1 141 140[ I46j . 

We have also considered an operational semantics for our logic language where a disequation t\^ti 
holds iff t\ and £2 are not unifiable. This operational semantics is sound, but not complete w.r.t. the 
declarative semantics. Indeed, if a goal operationally succeeds in a program, then it is true in the 
least Herbrand model of the program, but not vice versa. Thus, the proof of correctness of our 
transformation rules w.r.t. the operational semantics cannot be based on previous results and it is 
much more elaborate. Indeed, it requires some restrictions, related to the modes of the predicates, 
both on the programs to be specialized and on the applicability of the transformation rules. 

In Section [3] we have extensively discussed the fact that our specialization technique is more 
powerful than partial deduction |21[ 129 j. The main reason of the greater power of our technique is 
that it uses more powerful transformation rules. In particular, partial deduction corresponds to the 
use the definition introduction, definition elimination, unfolding, and folding transformation rules, 
with the restriction that we may only fold a single atom at a time in the body of a clause. 

Our extended rules allow us to introduce and transform new predicates defined in terms of dis- 
junctions of conjunctions of atoms (recall that a set of clauses with the same head is equivalent to 
a single clause whose premise is the disjunction of the bodies of the clauses in the given set). In 
this respect, our technique improves over conjunctive partial deduction 8 , which is a specialization 
technique where new predicates are defined in terms of conjunctions of atoms. 

We have implemented the Determinization Strategy in the MAP transformation system "22] and we 
have tested this implementation by performing several specializations of string matching and parsing 
programs. We have also compared the results obtained by using the MAP system with those obtained 
by using the ECCE system for (conjunctive) partial deduction Our computer experiments con- 
firm that the Determinization Strategy pays off w.r.t. both partial deduction and conjunctive partial 
deduction. 

Our transformation technique works for programs where the only negative literals which are allowed 
in the body of a clause, are disequations between terms. The extension of the Determinization Strategy 
to normal logic programs would require an extension of the transformation rules and, in particular, 
it would be necessary to use a negative unfolding rule, that is, a rule for unfolding a clause w.r.t. 
a (possibly nonground) negative literal different from a disequation. The correctness of unfold/fold 
transformation systems which use the negative unfolding rule has been studied in contexts rather 
different from the one considered here (see, for instance, the work on transformation of first order 
programs j32j) and its use within the Determinization Strategy requires further work. 

The Determinization Strategy may fail to terminate for two reasons: (i) the Unfold-Simplify sub- 
sidiary strategy may apply the unfolding rule infinitely often, and (ii) the while-do loop of the Deter- 
minization Strategy may not terminate, because at each iteration the Define-Fold subsidiary strategy 
may introduce new predicates. 

The termination of the Unfold-Simplify strategy can be guaranteed by applying the techniques for 
finite unfolding already developed for (conjunctive) partial deduction (see, for instance, [51 12*31 [30 ) . 
Indeed, the unfolding rule used in this paper is similar to the unfolding rule used in partial deduction. 

The introduction of an infinite number of new predicates can be avoided by extending various 
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methods based on generalization, such as those used in (conjunctive) partial deduction [HI I13| I25[ 
37.. Recall that in conjunctive partial deduction we may generalize a predicate definition essentially 
by means of two techniques: (i) the replacement of a term by a variable, which is then taken as 
an argument of a new predicate definition, and (ii) the splitting of a conjunction of literals into 
sub conjunctions (together with the introduction of a new predicate for each subconjunction). It has 
been shown that the use of (i) and (ii) in a suitably controlled way, allows conjunctive partial deduction 
to terminate in all cases. However, termination is guaranteed at the expense of a possibly incomplete 
specialization or a possibly incomplete elimination of the intermediate data structures. 

In order to avoid the introduction of an infinite number of new predicate definitions while applying 
the Determinization Strategy, we may follow an approach similar to the one used in the case of 
conjunctive partial deduction. However, besides the generalization techniques (i) and (ii) mentioned 
above, we may also need (iii) the splitting of the set of clauses defining a predicate into subsets (together 
with the introduction of a new predicate for each subset). Similarly to the case of conjunctive partial 
deduction, it can be shown that suitably controlled applications of the generalization techniques (i), 
(ii), and (iii) guarantee the termination of the Determinization Strategy at the expense of deriving 
programs which may fail to be semideterministic. 

We leave it for further research the issue of controlling generalization, so that we achieve the 
termination of the specialization process and at the same time we maximize the reduction of nonde- 
terminism. 

In the string matching examples we have worked out, our strategy is able to automatically derive 
programs which behave like Knuth-Morris-Pratt algorithm, in the sense that they generate a finite 
automaton from any given pattern and a general pattern matcher. This was done also in the case of 
programs for matching sets of patterns and programs for matching regular expressions. 

In these examples the improvement over similar derivations performed by partial deduction tech- 
niques I13| 144] consists in the fact that we have started from naive, nondeterministic initial pro- 
grams, while the corresponding derivations by partial deduction described in the literature, use initial 
programs which are deterministic. Our derivations also improve over the derivations performed by 
using supercompilation with perfect driving |15| I47j and generalized partial computation which 
start from initial functional programs which already incorporate some ingenuity. 

A formal derivation of the Knuth-Morris-Pratt algorithm for pattern matching has also been pre- 
sented in [3] . This derivation follows the calculational approach which consists in applying equivalences 
of higher order functions. On the one hand the calculational derivation is more general than ours, 
because it takes into consideration a generic pattern, not a fixed one (the string [a, a, b] in our Exam- 
ple !3.3j) . on the other hand the calculational derivation is more specific than ours, because it deals with 
single-pattern string matching only, whereas our strategy is able to automatically derive programs in 
a much larger class which also includes multi-pattern matching, matching with regular expressions, 
and parsing. 

The use of the case split rule is a form of reasoning by cases, which is a very well-known technique 
in mechanical theorem proving (see, for instance, the Edinburgh LCF theorem prover JZj). Forms of 
reasoning by cases have been incorporated in program specialization techniques such as the already 
mentioned supercompilation with perfect driving |15| I47j and generalized partial computation |12| . 
However, the strategy presented in this paper is the first fully automatic transformation technique 
which uses case reasoning to reduce nondeterminism of logic programs. 

Besides specializing programs and reducing nondeterminism, our strategy is able to eliminate 
intermediate data structures. Indeed, the initial programs of our examples in Section all have 
intermediate lists, while the specialized programs do not have them. Thus, our strategy can be 
regarded as an extension of the transformation strategies for the elimination of intermediate data 
structures (see the deforestation technique [IS] f° r the case of functional programs and the strategy 
for eliminating unnecessary variables jHH] for the case of logic programs). Moreover, our strategy 
derives specialized programs which avoid repeated subcomputations (see the Context-free Parsing 
example of Section l7.5|) . In this respect our strategy is similar to the tupling strategy for functional 
programs [HI] . 

Finally, our specialization strategy is related to the program derivation techniques called finite 
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differencing |33| and incrementalization [22j. These techniques use program invariants to avoid costly, 
repeated calculations of function calls. Our specialization strategy implicitly discovers and exploits 
program invariants when using the folding rule. It should be noticed, however, that it is difficult to 
establish in a rigorous way the formal connection between the basic ideas underlying our specialization 
strategy and the above mentioned program derivation methods based on program invariants. These 
methods, in fact, are presented in a very different framework. 
This paper is an improved version of |35j . 
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Appendix A. Proof of Theorem [f3 



For the reader's convenience, we rewrite the statement of Theorem |BJ 

Theorem |S] (Correctness of the Rules w.r.t. the Operational Semantics) Let Pq, ... ,P n be 

a transformation sequence constructed by using the transformation rules EEl and let p be a non-basic 
predicate in P n . Let M be a mode for Pq U Defs n such that: (i) Pq U Defs n is safe w.r.t. M, (ii) 
Pq U Defs n satisfies M, and (iii) the applications of the unfolding, folding, head generalization, and 
case split rules during the construction of Pq, . . . , P n are all safe w.r.t. M. Suppose also that: 

1. if the folding rule is applied for the derivation of a clause C in program Pk+i from clauses 
Ci, . . . , C m in program Pk using clauses D\, . . . , D m in Defsk, with 0<fc<n, 

i/ien for every i £ {1, . . . , m} there exists j £ {1, . . . , n— 1} such that Di occurs in Pj and Pj+i 
is derived from Pj by unfolding Dj. 

2. during the transformation sequence Po,...,P n the definition elimination rule either is never 
applied or it is applied w.r.t. predicate p once only, when deriving P n from P n -\. 

Then: (i) P n is safe w.r.t. M, (ii) P n satisfies M, and (iii) for each atom A which has predicate p and 
satisfies mode M, A succeeds in Pq U Defs n iff A succeeds in P n . 

The proof of Theorem will be divided in four parts, corresponding to Propositions 01 IU El and|Bl 
presented below. 

Proposition |3] {Preservation of Safety) shows that program P n derived according to the hypotheses 
of Theorem is safe w.r.t. mode M (that is, Point (i) of the thesis of Theorem EJ). Proposition 
(Preservation of Modes) shows that P n satisfies M (that is, Point (ii) of the thesis of Theorem EJ). 
Propositions El (Partial Correctness) and H3 ( Completeness) show the if part and the only-if part, 
respectively, of Point (iii) of the thesis of Theorem H3 For proving these propositions we will use 
various notions and lemmata which we introduce below. 

Al. Preservation of Safety 

In this section we prove that, if the transformation rules are applied according to the restrictions 
indicated in Theorem then from a program which is safe w.r.t. a given mode we derive a program 
which is safe w.r.t. the same mode. 

Proposition 3 (Preservation of Safety) Let Pq, . . . , P n be a transformation sequence constructed 
by using the transformation rules EE1 Let M be a mode for Pq U Defs n such that: (i) Pq U Defs n 
is safe w.r.t. M and (ii) the applications of the unfolding, head generalization, and case split rules 
during the construction of Pq, . . . , P n are safe w.r.t. M. Then, for k = 0, . . . , n, the program P^ is 
safe w.r.t. M. 

Proof: The proof proceeds by induction on k. During the proof we will omit the reference to mode 
M. In particular, we will simply say that a program (or a clause) is safe, instead of saying that a 
program (or a clause) is safe w.r.t. M. 

For k = the thesis follows directly from the hypothesis that Pq U Defs n is safe and thus, Pq is 
safe. Let us now assume that, for k < n, program P^ is safe. We will show that also Pk+i is safe. We 
consider the following cases, corresponding to the rule which is applied to derive Pk+i from P^. 

Case 1: Pk+i is derived by applying the definition introduction rule. Pk+i is safe because Pk is safe 
and, by hypothesis, every definition clause in Defs n is safe. 

Case 2: Pk+i is derived by applying the definition elimination rule. Then Pk+i is safe because Pk is 
safe and Pk+i Q Pk- 

Case 3: Pk+i is derived by a safe application of the unfolding rule (see Definition |1J) . Let us consider 
a clause Di in Pk+i which has been derived by unfolding a clause C in Pk of the form: H <— 
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G\, A, G2 w.r.t. the atom A. Then there exists a clause C% in Pk such that (i) A is unifiable with 
hd(Ci) via the mgu and (ii) clause Di in Pk+i of the form {H <— G%, bd(Ci), Gg)^- 

Let us now show that Di is safe. We take a variable X occurring in a disequation t\ 7^ ti in 
the body of Di, and we prove that X is either an input variable of hd(Di) or a local variable 
of t\ 7^ t2 in Dj. We have that t\ 7^ £2 is of the form {u\ 7^ U2)'&i, where u\ 7^ U2 is a disequation 
occurring in G\, bd(Ci),G2- We consider two cases: 

Case A: u\ 7^ U2 occurs in G\ or G2. Since t\ 7^ t2 is of the form (u\ 7^ U2)$i, there exists a 
variable 1" € vars(u\ 7^2x2) such that X € yars(Y$). By the inductive hypothesis, C is safe and 
thus, Y is either an input variable of hd(C) or a local variable of u\ 7^2 in C. We have that: 
(i) if Y is an input variable of hd(C) then X is an input variable of hd(Di), and (ii) if Y is a 
local variable of u\ 7^2 in C then X = Y = Y$i and X is a local variable of ii 7^2 in Di. 

Case B: u\^U2 occurs in bd(Ci). From the definition of safe unfolding we have that X is either: 
(B.l) an input variable of H$i or (B.2) a local variable of u\ 7^2 in Cj. In case (B.l) X is an 
input variable of hd(Di), which is equal to H-di. In case (B.2) X does not occur in i?j and, since 
vars(C) n vars{Ci) = 0, X is a local variable of (ui^Uzj'&i, which is equal to t\^t2, in Di. 

Case 4: Pfc+i is derived by applying the folding rule. Let us consider a clause Pk+i of the form: 
C. H <— Gi, newp(Xi, . . . ,X h )tf, G 2 
which has been derived by folding the following clauses in P^: 



Now we take a variable X occurring in a disequation ii 7^2 in the body of C, and we prove that 
X is either an input variable of H or a local variable of £i 7^2 in C. 

The disequation £17^2 occurs in G\ or G2 and, by the hypothesis that Pk is safe, either X is an 
input variable of H or, for i = 1, . . . , m, X is a local variable of ii 7^2 in Gj. If for i = 1, . . . , m, 
X is a local variable of ti 7^2 in Gj, then X is a local variable of t\ 7^2 in G, because by the 
definition of the folding rule (see Rule0J X does not occur in newp(Xi, . . . , X^)$. 

Case 5: Pk+i is derived by applying the subsumption rule. Pk+i is safe because Pk+i C P k . 

Case 6: Pk+i is derived by a safe application of the head generalization rule (see Definition EJ) . Let 
GenC be a clause in Pk+i of the form: 

H <- Y = t,Body 

derived from a clause C in of the form: 
H{Y/t} <- Sody 

where {Y/i} is a substitution such that Y occurs in H and V does not occur in C. 

Let us now prove that GenC is safe. Let X be a variable occurring in a disequation t\ 7^ ti in 
Body. By inductive hypothesis C is safe and thus, X is either an input variable of H{Y/t} or a 
local variable of t\^ti in C. If X is an input variable of H{Y/t}, then it is also an input variable 
of H, because from the definition of safe head generalization it follows that H and H{Y/t} have 
the same input variables. If X is a local variable of t\ 7^ t-i in C, then X is a local variable of 
£17^2 in GenC, because X does not occur in Y = t. 




using the following definition clauses in Defs/.: 




TO 



45 



Case 7: Pk+x is derived by a safe application of the case split rule (see Definition EJ) to a clause C in 
Pfc. Let us consider the following two clauses in Pk+i- 

C x . (H*- Body){X/t] 
C 2 - H «- X^t,Body. 

derived by safe case split from C. Let us now show that C\ and C2 are safe. Let us consider 
clause C\ and let Y be a variable occurring in a disequation t\ 7^2 in Body{X/t]. t\ 7^2 is of 
the form {u\^U2){X/t} where u\^U2 occurs in Body. We consider two cases. 

Case A: Y 6 vars(t). By the definition of safe case split, either Y is an input variable of H or 
Y does not occur in C. If Y is an input variable of H, then Y is an input variable of H{X/t}, 
and if Y does not occur in C, then Y is a local variable of (u\ ^U2){X/t} in C\. 

Case B: Y vars(t). We have that Y occurs in u\^U2, and thus, from the inductive hypothesis 
that C is safe, it follows that Y is either an input variable of H or a local variable of u\^U2 in 
C. If Y is an input variable of H, then Y is either an input variable of H{X/t}, and if Y a local 
variable of u\^U2 in C, then it is a local variable of {u\ ^U2){X/t} in C\. 

Thus, C\ is a safe clause. 

Let us now consider clause C2 and let Y be a variable occurring in a disequation t\ 7^ £2 in 
I/t, Body. If ii 7^ £2 occurs in 5orfy then from the inductive hypothesis that C is safe, it 
follows that Y is either an input variable of if or a local variable of t\ 7^ £2 in C2. If t\ 7^ £2 
is A 7^ t, then by the definition of safe case split (i) X is an input variable of H, and (ii) for 
every variable Y £ vars(t), either (ii.l) Y is an input variable of H or (ii. 2) Y does not occur in 
H, Body, and thus, Y is a local variable of X^t in C2. 

Thus, C2 is a safe clause. 

Case 8: Pfc+l is derived by applying the equation elimination rule to a clause C\ in Pk of the form: 
H <— Gi, ti = £2, C2. We consider two cases: 

Case A: t\ and £2 are unifiable via the most general unifier •&. We derive the clause: C%. (H 
G%, G^) 1 !?. We can show that clause C2 is safe similarly to Case 3 (A). 

Case B: t\ and £2 are not unifiable. In this case Pk+i is safe because Pk+i is Pk — {C±} and, by 
inductive hypothesis all clauses in P^ are safe. 

Case 9: Pk+i is derived by applying the disequation replacement rule to clause C in Pk. Let us 
consider the cases 9.1-9.5 of Rule El Cases 9.1 and 9.3-9.5 are straightforward, because they 
consist in the deletion of a disequation in bd(C) or in the deletion of clause C. Thus, in these 
cases the safety of program Pk+\ derives directly from the safety of Pk- 

Let us now consider case 9.2. Suppose that clause C is of the form: H <— G±, f(t\, . . . ,t m ) 7^ 
f(ui, . . . , u m ), G2, and it is replaced by the following m (> 0) clauses: 

Ct. H^G 1 ,t 1 ^u 1 ,G 2 



Cm- H <— G±, t m J=U m , G2 

We now prove that, for j = 0, . . . , m, Cj is safe. Indeed, for j = . . . m, if we consider a variable 
X occurring in tj 7^ Uj then, by the inductive hypothesis, either (i) X is an input variable of H 
or (ii) AT is a local variable of f(t\, . . . , t m ) 7^ f(u\, . . . , u m ) in C, and thus, A is a local variable 
of tj 7^ Uj in Cj . 

In the case where X occurs in a disequation in G\ or G2, it follows directly from the inductive 
hypothesis that A is either an input variable of H or a local variable of that disequation in Cj . 

Thus, Cj is safe. □ 
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A2. Preservation of Modes 

Here we show that, if the program Po U Defs n satisfies a mode M and we apply our transformation 
rules according to the restrictions indicated in Theorem El then the derived program P n satisfies M. 

In this section and in the rest of the paper, we will use the following notation and terminology. Let 
us consider two non-basic atoms A\ and A 2 of the form p(t\, . . . , t m ) and p(u±, . . . , u m ), respectively. 
By A\ =A2 we denote the conjunction of equations: t\ =ui, . . . , t m = u m . By mgu(Ai, A2) we denote a 
relevant mgu of two unifiable non-basic atoms A\ and A 2 - Similarly, by mgu(t\,t 2 ) we denote a relevant 
mgu of two unifiable terms t\ and t 2 . The length of the derivation Go 1 — >p G\ 1 — >p . . . 1 — >p G n is 
n. Given a program P and a mode M for P, we say that a derivation Go 1 — >p G\ 1 — >p . . . 1 — >p G n 
is consistent with M iff for i = 0, . . . , n — 1, if the leftmost atom of Gj is a non-basic atom A then A 
satisfies M. 

The following properties of the operational semantics can be proved by induction on the length of 
the derivations. 

Lemma 1 Let P be a program and G\ a goal. If G\ succeeds in P with answer substitution then 
for all goals G 2 , (Gi,G 2 ) 1 — >* P G 2 tf. 

Lemma 2 Let P be a safe program w.r.t. mode M, let Eqs be a conjunction of equations, and let 
Gi be a goal without occurrences of disequations. For all goals G 2 , if there exists a goal (A', G') such 
that A' is a non-basic atom which does not satisfy M and 

(Egs,G 1 ,G 2 ) —J, (A',G') 

then there exists a goal (A", G") such that A" is a non-basic atom which does not satisfy M and 

(G 1 ,Eqs,G 2 ) —J, 

Lemma 3 Let Po, • • • > -Pn be a transformation sequence constructed by using the transformation rules 
EEl Let M be a mode for Po U Defs n such that: (i) Po, Uf e/s n is safe w.r.t. M, (ii) Po U Defs n satisfies 
M, and (hi) the applications of the unfolding, folding, head generalization, and case split rules during 
the construction of Po, . . . , P n are safe w.r.t. M. Then, for k = 0, . . . , n, for all goals G, if all derivations 
from G using Po U Defs n are consistent with M, then all derivations from G using P/% are consistent 
with M. 

Proof: By Proposition 01 we have that, for k = 0, ... ,n, the program P^ is safe w.r.t. M. 
The proof proceeds by induction on k. 

The base case (k = 0) follows from the fact that all derivations from G using Po are also derivations 
using P U Defs n . 

In order to prove the step case, we prove the following counterpositive statement: 

for all goals (Aq,Gq), if there exists a goal (A S ,G S ) such that (Aq,Go) i — > p fe+i (A S ,G S ) and (A S ,G S ) 
does not satisfy M, then there exists a goal (At,Gt) such that (^4o>Gd) i — >p (At,Gt) and At does 
not satisfy M. 

We proceed by induction on the length s of derivation of (A S ,G S ) from (Aq,Gq) using P^+i- As an 
inductive hypothesis we assume that, for all r < s and for all goals G, if there exists a derivation 
G 1 — >p k+1 ■ ■ ■ 1 — ^Pfe+i (A- 3 Gy) of length r, such that ^4 r does not satisfy M, then there exists (A' , G') 
such that G 1 — >* Pk (A 1 , G') and A' does not satisfy M. 

Let us consider the derivation (Aq,Gq) i — >p fc+1 ••• 1 — >P fc+ i (-4 S ,G S ) of length s, such that A; does 
not satisfy M. 

If s = then G is (A S ,G S ) and (Aq,Gq) i — >p (^4<j,G<j) where A; does not satisfy M. 
If s > then we may assume Ao^true, and we have the following cases. 

Case 1: Aq is the equation ti =t 2 . Thus, by Point (1) of the operational semantics of Section f2.3[ the 
derivation from (A; Go) to (A s , G s ) using Pk+i is of the form: 

(A ,G ) 1 — > Pk+1 G mgu(t 1 ,t 2 ) 1 — > Pk+1 . . . 1 — (A S ,G S ) 

By the inductive hypothesis there exists (A',G') such that Go mgu(ti,t 2 ) 1 — >* P (A',G') and A' does 
not satisfy M. Thus, (A , G ) 1 — ^ (A', G'). 
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Case 2: Ao is the disequation t\^t 2 . The proof proceeds as in Case 1, by using Point (2) of the 
operational semantics and the inductive hypothesis. 

Case 3: Ao is a non-basic atom which satisfies M. (The case where Aq does not satisfy M is subsumed 
by the case s = 0.) By Point (3) of the operational semantics, the derivation from (Aq, Go) to (A s , G s ) 
using Pk+i is of the form: 

(A), Go) ^P k+1 (bd{E),G )mgu{A ,hd{E)) ^ Pfc+1 ■■■^p k+1 {A S ,G S ) 
where E is a renamed apart clause in Pk+i- 

If E G Pk then (Aq,G$) i — >p k (bd(E), Go)mgu(Ao, hd(E)) and the thesis follows directly from the 
inductive hypothesis. 

Otherwise, if E G (Pk+i — Pk)> we prove that: 

there exists a goal (A t , Gt) such that (Ao, Go) i — >p fc (At, Gt) and A does not satisfy M (f) 
by considering the following cases, corresponding to the rule which is applied to derive E. 

Case 3.1: E is derived by applying the definition introduction rule. Thus, E G Defs n and (f) follows 
from the inductive hypothesis and the hypothesis that Po U Defs n satisfies M. 

Case 3.2: E is derived by unfolding a clause C in P^ of the form H <— D,Gi, A, G2, where Z? is a 
conjunction of disequations, w.r.t. the non-basic atom A. By Proposition ^ we may assume that 
no disequation occurs in G\,A, G 2 . Let Ci, . . . , C m , with m > 0, be the clauses of such that, 
for all i G {1, . . . , m} A is unifiable with the head of Cj via the mgu #j. 

Thus, E 1 is of the form (H <— D,Gi, bd(C{), G 2 )$i, for some i G {1, . . . , m}, and the derivation 
from (Aq, Gq) to G s ) using P^+i is of the form: 

(A ,Go) ^ Pk+1 ((D,G 1 ,bd(C i ),G 2 )$ i ,Go)vi ^p k+1 ■ ■ ■ ^p k+1 (A S ,G S ) 

where rji is an mgu of Ao and Hfii. By the inductive hypothesis there exists (A',G') such that 
A 1 does not satisfy M and: 

((D,G l ,bd(C i ),G 2 )$ i ,Go)vi ^* Ph (A',G>) 
Since #j is mgu(A, hd(Ci)), $i is relevant, and vars(Go) n uars((A, hd(Ci))) = 0, we have that: 

(Ad, bdiC&G^GoWiTH ^* Pk (A',G') 
and thus, by the definition of the operational semantics (Point 1), we have that: 

(A = hd(C i ),Ao = H,D,G 1 ,bd(C i ),G 2 ,G ) ^* Pk (A', G') 
Then, by properties of mgu's, we have that: 

(A = H,A = hd(C i ),D,G 1 ,bd(C i ),G 2 ,Go) ^* Pk (A', G') 

Since Aq satisfies M, C is safe, and Ci is renamed apart, we have that vars(D mgu(Ao, H)) n 
vars(A,hd(Ci)) = 0. Thus, (D mgu(Ao,H) mgu(A mgu(A , H), hd(d))) = (D mgu(A ,H)) 
and we have that: 

(Ao = H,D,A = hd(C i ),G 1 ,bd(C i ),G 2 ,Go) ^\ (A>,G>) 
Now, by Lemma 12 there exists a goal (A", G") such that: 

(A = H,D,G l ,A = hd(C t ),bd(C i ),G 2 ,G ) — ^ (A" ,G") 
where A" is a non-basic atom which does not satisfy M. There are two cases: 

Case A. (Ao = H,D,G\) 1 — >* Pk (A",G"') for some goal G'" . In this case, by using clause 
C £ Pk, we have that: 

(A ,Go) ^ Pk (D,G 1 ,A,G 2 ,Go)mgu(Ao,H) ^* Pk (A",G"") 
for some goal G"" . 
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Case B. There is no (A'",G'") such that (A = H,D,d) i — >* Pk (A'",G m ) and A" 1 does not 
satisfy M. In this case (Aq = H, D,Gi, A = hd(Ci)) succeeds in P^. It follows that, for some 
substitution 

(A =H, D, G u A = hd(Ci), bd(Ci),G 2 , G ) 

, — ^ (A=/id(C i ),6d(C i ),G 2 ,Go)^ (by Lemma HJ 

^ Pk (bd(C t ),G 2 ,G )$ mgu(A$,hd(Ci)) 

(because mgu's are relevant and C{ is renamed apart) 

^* Pk {A", G"") 
for some goal G"" . Thus, 
(A = H,D,G 1 ,A,G 2 ,G ) 

— ^ (A,G 2 ,G )$ 

^ Pk (bd(Ci),G 2 ,G )$ mgu{A§,hd{Ci)) 
^ {A",G"") 
and therefore, by using clause C € 

(A), Go) ^ {A",G"") 
where A" is a non-basic atom which does not satisfy M. Thus, (f ) holds. 

Case 3.3: E is derived by a safe application of the folding rule (see Definition [SJ) . In particular, 
suppose that from the following clauses in P^: 

d.H<- G 1 ,(A 1 ,K 1 )$,G 2 

C m - H <— Gi,(A m ,K m )'&,G 2 
and the following definition clauses in Defs^: 
D 1 . newp(X 1 , . . . ,X h ) «- A 1 ,K 1 

D m - newp(X 1 , X h ) <- A m , K m 

we have derived the clause E of the form: 

E. H ^G 1 ,newp(X 1 ,...,X h )$,G 2 

where Property £ of Definition [5] holds, that is, each input variable of newp(Xi, . . . is 
also an input variable of at least one of the non-basic atoms occurring in (H, Gx, Ai$, . . . , A m $)- 

Thus, the derivation from (Aq, Go) to (A s , G s ) using Pk+i is of the form: 

(Aq,G ) ^p k+1 (G 1 ,newp(X 1 ,...,X h )#,G 2 ,Go)mgu(Ao,H) ^* Pk+1 (A S ,G S ) 

By the inductive hypothesis, there exists a goal (A', G') such that A' does not satisfy M and the 
following holds: 

(Gi,nevjp(X 1 ,...,X h )0,G 2 ,G o )mgu{A o ,H) ^* Pk (A',G') 

There are two cases: 

Case A: Gimgu(Ao, H) i — >p k (A',G") for some goal G" . In this case we have that, for some 
i E {1, . . . , to}, and for some goal G'" , 

(A Q , G ) i — > Pk (Gi, {Ai,Ki)-d, G 2 , Go)mgu(A , H) (by using clause C t in P k ) 

^ Pk (A',G'") 

Thus, (f) holds. 

Case B: There is no (A",G") such that Gimgu(A ,H) i — >* Pk (A",G") and A" does not satisfy 
M. In this case Gimgu(Ao, H) succeeds in P^, and thus, for some substitution a, 
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(A ,G ) ^* Pk (new P (Xi,...,X h )ti,G 2 ,G )a^* Ph (A',G') 
By Property X, we have that newp(X\, . . . , Xhj'&ct satisfies M. 

It can be shown the following fact. Let us consider the set of all definition clauses with head 
predicate newp in Defs k , for any k G {0, . . . , n}: 

newp(X 1 , X h ) <- Body! 

< 

newp(Xi, X h ) <- Body m 

If for a substitution (3 and a goal G, the atom newp(X\, . . . , Xh)/3 satisfies M and 
newp(Xi, . . . , Xh)P, G i — >* P (A',G'), where A' is a non-basic atom which does not sat- 
isfy M, then for some i G {1,...,to} we have that there exists a goal (^4t,Gj) such that 
Body^, G i — >p fe (At, Gt), where A t is a non-basic atom which does not satisfy M. 

By using this fact, we have that, for some i G {1, . . . , to}, 

(A), Go) — >^ ((^,^)tf,G 2 ,G )a (A t ,G t ) 

where At is a non-basic atom which does not satisfy M and thus, (f) holds. 

Case 3.4: E is derived by applying the head generalization rule. In this case (f) follows from the 
inductive hypothesis and from the definition of the operational semantics (Point 1). 

Case 3.5: E is derived by safe case split (see Definition E|) from a clause C in P^. By Proposition^ 
we may assume that C is of the form: H <— D,B, where D is a conjunction of disequations and 
in B there are no occurrences of disequations. Thus, E is of one of the following two forms: 

Ci. (H^D,B){X/t) 
C 2 . H <- X^t,D,B 

where X is an input variable of H, X does not occur in t, and for all variables Y G vars(t), 
either Y is an input variable of H orY does not occur in C . 

Case A: E is C\. Thus, the derivation from (^4o,Go) to (A S ,G S ) using Pk+i takes the form: 

(A ,G ) ^ Pk+1 ((D,B){X/t},G )mgu(A ,H{X/t}) ^* Pk+i (A S ,G S ) 

By the inductive hypothesis, there exists a goal (A', G') such that A 1 does not satisfy M and the 
following holds: 

((D,B){X/t},G )mgu(A ,H{X/t}) ^* Pk (A',G') 
By properties of mgu's and Point (1) of the operational semantics, we have that: 

A = H, X = t, D, B, G ^* Pk (A',G r ) 
By the conditions for safe case split, we have that: 

vars((X = t) mgu(A , H)) n vars((D, B, G ) mgu(A , H)) =0 
and therefore: 

A = H, D, B, Go ^* Pk (A',G') 
Thus, by using clause C G Pk, 

(A ,G ) ^ Pk (D, B,Go)mgu(A ,H) ^* Pk (A', G') 
and (f) holds. 

Case B: E is C 2 - Thus, the derivation from (Ao,Go) to (A S ,G S ) using Pk+i takes the form: 

(Ao,Go) ^p k+1 (X^t,D,B,Go)mgu(Ao,H) (A S ,G S ) 
By the inductive hypothesis, there exists a goal (A' , G') such that A' does not satisfy M and: 

(X^t,D,B,G )mgu(A ,H) ^* Pk (A',G>) 

Since the answer substitution for any successful disequation is the identity substitution, we have 
that: 
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(D,B,G )mgu(A ,H) ^\ (A',G') 
Thus, by using clause C £ Pj, we have that 

(Ao,G ) ^* Pk (A',G>) 
and (f) holds. 

Case 3.6: E is derived by applying the equation elimination rule. In this case (f) is a consequence of 
the inductive hypothesis, Point (1) of the operational semantics, the safety of P^, and Lemma|21 

Case 3.7: E is derived by applying the disequation replacement rule. In this case (f) is a conse- 
quence of the inductive hypothesis, Point (2) of the operational semantics, and the properties of 
unification. □ 

From Lemma |21 and Definition |2 we have the following proposition. 

Proposition 4 (Preservation of Modes) Let Po, . . . , P n be a transformation sequence constructed 
by using the transformation rules Let M be a mode for Pq U Defs n such that: (i) Po U Defs n 
is safe w.r.t. M, (ii) Pq U Defs n satisfies M, and (iii) the applications of the unfolding, folding, head 
generalization, and case split rules during the construction of Pq, . . . , P n are safe w.r.t. M. Then, for 
k = 0, . . . , n, the program P^ satisfies M. 

A3. Partial Correctness 

For proving the partial correctness of the transformation rules w.r.t. the operational semantics (that 
is, Proposition [5J) , we will use the following two lemmata. 

Lemma 4 Let P be a safe program w.r.t. mode M, let Eqs be a conjunction of equations, and let G\ 
be a goal without occurrences of disequations. For all goals G2, if 

(Eqs, G x , G 2 ) G 2 $ 
then either 

(G 1} Eqs, G 2 ) G 2 tf 
or there exists a goal (A', G') such that A' is a non-basic atom which does not satisfy M and 

Gi —J, (A',C). 

Lemma 5 Let Pq, . . . ,P n be a transformation sequence constructed by using the transformation rules 
Let M be a mode for Pq U Defs n such that: (i) Po U Defs n is safe w.r.t. M, (ii) Pq U Defs n satisfies 
M, and (iii) the applications of the unfolding, folding, head generalization, and case split rules during 
the construction of Po, . . . ,P n are all safe w.r.t. M. 

Then, for k = 0, . . . ,n — 1, for each goal G, if there exists a derivation G 1 — > p k+1 ■ ■ ■ 1 — *Pk+i ^ rue 
which is consistent with M, then G 1 — >p feU£)e j s true, that is, G succeeds in P^ U Defs n . 

Proof: By hypotheses (i— iii) , and Propositions El and |1J for k = 0, . . . ,n, program P& is safe and 
satisfies M. Let G be a goal of the form (Aq, Gq), such that there exists a derivation 

5 : (A , Go) 1 — > Pk+1 . . . 1 — > Pk+1 true 
which is consistent with M. We will prove that: 

(A ,Go) 1 — >* PkUDefSn true 
The proof proceeds by induction on the length s of the derivation S. 

Base Case. For s = 0, the goal (^4o ; Go) is true and the thesis follows from the fact that true succeeds 
in all programs. 

Step Case. Let us now assume the following 

Inductive Hypothesis: for all r < s and for all goals G, if there exists a derivation G 1 — y p k+1 ■ ■ ■ 1 — > Pk+i 
true of length r which is consistent with M, then G 1 — >p k {jDefs t rue - 
There are the following three cases. 
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Case 1: Aq is the equation t\ = t 2 . By Point (1) of the operational semantics of Section l2.3[ the 
derivation 5 is of the form: 

(ti = t 2 ,G ) i — >p k+1 G mgu(t 1 ,t 2 ) i — >p k+1 . . . i — >p k+1 true 

Thus, the derivation Go mgu(tx, t 2 ) 1 — > P k+1 ■■■ 1 — > Pk+i true has length s — 1 and it is consistent 
with M. By the inductive hypothesis there exists a derivation Gq mgu(ti,t 2 ) 1 — >p fc true. Thus, 
(Aq, Go) i — >p k true and (Ao, Gq) succeeds in Pk U Defs n . 

Case 2: Aq is the disequation t\^t 2 . The proof proceeds as in Case 1, by using Point (2) of the 
operational semantics and the inductive hypothesis. 

Case 3: Aq is a non-basic atom which satisfies M (otherwise there is no derivation starting from 
(Aq,Gq) which is consistent with M). By Point (3) of the operational semantics, the derivation 8 is 
of the form: 

(A), Go) 1 — >P k+1 (bd(E),G )mgu(A ,hd(E)) i — > Pk+1 ... i — iree 
where E 1 is a renamed apart clause in Pfc+i- 

If £7 E then (Aq,Gq) i — >p k (bd(E),Go)mgu(Ao,hd(E)) and the thesis follows directly from the 
inductive hypothesis. 

Otherwise, if E G (-Pfc+l — Pk)-, we prove that (Aq,Gq) succeeds in Pk U Defs n by considering the 
following cases, which correspond to the rules applied for deriving E. 

Case 3.1: E is derived by applying the definition introduction rule. Thus, E is a clause in Defs n of 
the form: newp(Xi, . . . , X^) <— B and the derivation 5 is of the form: 

(newp(t 1 ,...,t h ),G ) i — > DefSn (B{X 1 /t 1 , . . . , X h /t h },G ) i — > Pfc+1 ...i — > Pfe+1 irae 
By the inductive hypothesis, we have that: 

(B{X 1 /t u ...,X h /t h },G ) ^* Pk true 
and thus, 

(newp(tt, t h ), G ) i — >p kUD efs n true 

Case 3.2: E is derived by unfolding a clause G in Pk of the form H <— D,G\, A,G 2 , where is a 
conjunction of disequations, w.r.t. the non-basic atom A. By Proposition ^ we may assume that 
no disequation occurs in Gi, A, G 2 . Let C\, . . . , C m , with m > 0, be the clauses of Pk such that, 
for all i G {1, . . . , m} ^4 is unifiable with the head of Gj via the mgu 

Thus, E is of the form (H <— D,G\, bd(Ci), G 2 )<&i, for some i 6 {1, . . . , m}, and the derivation 
5 is of the form: 

(Aq, Go) ^p k+1 ((D,G 1 ,bd(Ci),G 2 )^i,G )TH ^p k+1 ■■■^>p k+1 true 
where rji is an mgu of Aq and H"&i. By the inductive hypothesis we have that: 

((D,G 1 ,bd(C i ),G 2 )# i ,G )r H ^* Pk true 
Since #j is mgu(A, hd(Ci)), #j is relevant, and vars(Go) n ?;ars((j4, hd(G{))) = 0, we have that: 

(D,G x ,bd{Ci),G2,G Q )#irH ^p k true 
and thus, by the definition of the operational semantics (Point 1), we have that: 

(A = M^),A) = #,AGi,M(G),G 2 ,G ) ^ Pk true 
Then, by properties of mgu's, we have that: 

(Ao = H,A = hd(C i ),D,G 1 ,bd(C i ),G 2 ,Go) ^p k true 

Since Aq satisfies M, C is safe, and C\ is renamed apart, we have that vars(D mgu(AQ,H)) n 
vars(A,hd(Ci)) = 0. Thus, (D mgu(A ,H) mgu(A mgu(A , H), hd(d))) = (D mgu(A ,H)) 
and we have that: 

{Ao=H,D,A=hd(C i ),Gi,bd(C i ),G 2: Go) ^* Pk true 
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Now, by Lemma EJ there are the following two cases. 

Case A. (A = H, D, G u A = hd(d), bd(d), G 2 , Go) ' — >* Pk true 

In this case, by Points (1) and (3) of the operational semantics we have that: 

(A = H,D,G 1 ,A,G 2 ,G ) ^* Pk true 
and thus, by using clause C in P^, 

(A , G ) i — >* Pk true 
Case B. There exists a goal (A', G') such that: 

(A =H,D,Gt) ^ (A',G') 

where A' is a non-basic atom which does not satisfy the mode M . In this case we have that, for 
some goal G", 

A — >^ (A',G") 

which is impossible because Aq and Pt satisfy M, 

Case 3.3: E is derived by a safe application of the folding rule (see Definition EJ) . In particular, 
suppose that from the following clauses in P^: 

d.H<- G 1 ,(A 1 ,K 1 )^,G 2 

C m - H <— Gi,(A m ,K m )'d,G 2 
and the following definition clauses in Defs k '- 
D 1 . newp(X 1 , . . . , X h ) «- A\,K\ 

D m . newp(X 1 , X h ) <- A m , K m 
we have derived the clause E of the form: 
E. H *-G 1 ,newp(X 1 ,...,X h )-&,G 2 

where Property S of Definition El holds, that is, each input variable of newp(X\, . . . , is 
also an input variable of at least one of the non-basic atoms occurring in (H, G\, Ax$, . . . , A m -d). 

Thus, the derivation 5 is of the form: 

(Ao,G ) i — >p k+1 {Gi,newp(X 1 ,. . . ,X h )i9,G 2 ,G )mgu(A ,H) i — >* Pk+i true 

By the inductive hypothesis, the following holds: 

(Gi, newp(X 1 , . . . , X h )i),G 2 ,G )mgu(A , H) i — >* Pk true 

and therefore, for some substitution a, 

(A ,G ) i — >* Pk (newp(X 1 , . . . ,X h )$,G 2 ,G )a i — >* Pk true 

By Property S, we have that newp(X\, . . . , X^fia satisfies M. 

It can be shown the following fact. Let us consider the set of all definition clauses with head 
predicate newp in Defs k , for any k G {0, . . . , n}: 

newp(X 1 , X h ) <- Body 1 

newp(X 1 , X h ) <- -Borfy m 

If for a substitution (5 for a goal G, the atom newp{X\, . . . , Xh)(3, G satisfies M and we have that 
newp(Xi, . . . , Xh)P, G i — >p fe irae, then for some i G {1, . . . , m} we have that Bodyfl i — >p fe iree. 

By using this fact, we have that, for some i G {1, . . . , m}, 

(A ,G ) ^* Pk ({A i ,K i )^G 2 ,G )a ^p k true 
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Case 3.4: E is derived by applying the head generalization rule. In this case (Aq,Gq) i — >* Pk true 
follows from the inductive hypothesis and from the definition of the operational semantics (Point 

!)• 

Case 3.5: E is derived by safe case split (see Definitional) from a clause C in Pfc. By Proposition ^ 
we may assume that C is of the form: H <— D,B, where D is a conjunction of disequations and 
in B there are no occurrences of disequations. Thus, E is of one of the following two forms: 

Ci. (H ^D,B){X/t} 
C 2 . H <- X^t,D,B 

where X is an input variable of H, X does not occur in t, and for all variables Y € vars(t), 
either Y is an input variable of H orY does not occur in C. 

Case A: E is C\. Thus, the derivation 5 takes the form: 

(4,, Go) ^p fe+1 ((A£){X/t},Go)m 5 K(A 0)J ff{X/i}) ^p fc+1 * rae 
By the inductive hypothesis, we have that: 

{(D,B){X/t},G )mgu{A ,H{X/t}) ^* Pk true 
By properties of mgu's and Point (1) of the operational semantics, we have that: 

(A = H, X = t, D, B, G ) i — >* Pk true 
By the conditions for safe case split, we have that: 

vars((X = t) mgu(A ,H)) n vars((D, B, G ) mgu(A ,H)) = 
and therefore: 

(A =H, D, B, G ) ^* Pk true 
Thus, by using clause C € Pk, 

(A ,G ) ' — > Pk (D, B, Go)mgu(A ,H) i — >* Pk true 
Case B: E is C%- Thus, the derivation 5 takes the form: 

(A ,G ) ^p k+1 (X^t,D,B,G )mgu(Ao,H) ^* Pk+1 true 
By the inductive hypothesis, we have that: 

(X^t,D,B,Go)mgu(A ,H) i — >* Pk true 

Since the answer substitution for any successful disequation is the identity substitution, we have 
that: 

(D,B,G )mgu(A ,H) i — >* Pk true 
Thus, by using clause C € Pk, 
(A , G ) i — >* Pk true 

Case 3.6: E is derived by applying the equation elimination rule. In this case (Aq, Go) i — >* Pk true is 
a consequence of the inductive hypothesis, Point (1) of the operational semantics, the fact that 
Pk is safe and satisfies M, and Lemma 0] 

Case 3.7: E is derived by applying the disequation replacement rule. In this case (Aq, Go) i — > Pk true 
is a consequence of the inductive hypothesis, Point (2) of the operational semantics, and the 
properties of unification. □ 

Proposition 5 (Partial Correctness) Let Po, ■ ■ ■ ,P n be a transformation sequence constructed by 
using the transformation rules Let M be a mode for Po U Defs n such that: (i) Po U Defs n is 

safe w.r.t. M, (ii) Po U Defs n satisfies M, and (iii) the applications of the unfolding, folding, head 
generalization, and case split rules during the construction of Po, . . . ,P n are all safe w.r.t. M. 
Then, for k = 0, . . . , n, for each non-basic atom A which satisfies mode M, if A succeeds in P^ then 
A succeeds in Pq U Defsk ■ 
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Proof: Suppose that a non-basic atom A which satisfies M has a successful derivation using P&. By 
Proposition^ P^ satisfies M and, therefore, A has a successful derivation using P& which is consistent 
with M. Thus, the thesis follows from Lemma 03 □ 



A4. Completeness 

For the proofs of Propositions |3] (Preservation of Safety), 0] (Preservation of Modes), and [21 (Partial 
Correctness) , we have proceeded by induction on the length of the derivations and by cases on the rule 
used to derive program Pk+i from program P&. For the proof of Proposition El below (Completeness), 
we will proceed by induction w.r.t. more sophisticated well-founded orderings. This proof technique 
is a suitable modification of the one based on weight consistent proof trees |14| 146 j. 

The following definition introduces some well-founded orders and other notions which are needed 
for the proofs presented in this section. 

Definition 14 (i) Given a derivation 5 of the form Go i — >p G\ i — >p . . . i — >p G z , we denote by A(<5) 
the number of goals Gj in 5 such that Gi is of the form (A, K) where A is a non-basic atom, 
(ii) We define the following functions /i and v which given a program and a goal return either a 
non- negative integer or oo (we assume that, for all non- negative integers n, oo > n): 

/ p n \ _ ) min{A(<5) | 5 is a successful derivation of G in P} if G succeeds in P 
| oo otherwise 



min{n | n is the length of a successful derivation of G in P} if G succeeds in P 
oo otherwise 



(hi) Given a program P and two goals G\ and G2, we write G\ >-p G2 iff ^(P, Gi) > fi(P, G2). 
Similarly, we write G\ G2 iff /i(P, Gi) > fi(P, G2). 

(iv) Given two programs P and Q, we say that a derivation Go 1 — >p G\ 1 — >p . . . 1 — >p G z is quasi- 
decreasing w.r.t. >-q iff for i = 0, . . . , z — 1, either (1) Gj >~q Gj+i or (2) the leftmost atom of G« is a 
basic atom and Gj Gj + i. 

(v) Let P a program and Gi,G2 be goals. If there exists a derivation S from G\ to G2 such that 
X(5) = s, then we write G\ 1 — >p G2. 

For any program P the relation is a well-founded order and, for all goals Gi,G2, and G3, we 
have that G\ >-p G2 and G2 <^p G3 implies G\ >~p G3. 

Lemma 6 Let P be a program and G be a goal. If G succeeds in P then G has a derivation which 
is quasi-decreasing w.r.t. >-p. 

Proof: The derivation 5 from G using P such that A(5) < X(S') for all successful derivations 5' from 
G, is quasi-decreasing w.r.t. yp. □ 

Lemma 7 Let M be a mode for program P, such that P is safe w.r.t. M and P satisfies M. Let Pgs 
be a conjunction of equations, and Go,Gi,G2 be goals. Suppose also that no disequation occurs in 
G\ and all derivations from the goal (Go,Gi) are consistent with M. Then: 

(i) (G , Gi, Eqs, G 2 ) 1 — >* P true iff (G , Eqs, G x , G 2 ) 1 — > P trae 

(ii) (Go, Gi, £?ga, G 2 )) = m(P, (G , ^s, G x , G 2 )) 

(iii) v(P, (G , Gi, G 2 )) = u(P, (G , ^s, Gi, G 2 )) 

Proof: By induction on the length of the derivations. □ 

Lemma 8 Let M be a mode for program P, such that P is safe w.r.t. M and P satisfies M. Let 
i? be a substitution and Go,Gi,G2 be goals. Suppose also that no disequation occurs in G2 and all 
derivations from the goal (Gq,G2) are consistent with M. Then: 
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(i) if (G , Gi, G 2 )$ i — >* P true then (G , G 2 ) ' — >J> irue 

(ii) M (P, (G o ,Gi,G a )0) > MP, (G ,G 2 )) 

(iii) (G o ,Gi,G 2 )0) > i/(P, (G ,G 2 )) 

Proof: By induction on the length of the derivations. □ 

Lemma 9 Let M be a mode for program P, such that P is safe w.r.t. M and P satisfies M. Let 
Diseqs be a conjunction of disequations and G be a goal. Suppose also that vars (Diseqs) C\vars(G) = 0. 
Then: 

(i) (G, Diseqs) i — >p £ree iff (Diseqs, G) i — >p irae 

(ii) p(P, (G, Diseqs)) = fi(P, (Diseqs, G)) 

(iii) u(P, (G, Diseqs)) = v(P, (Diseqs, G)) 

Proof: The proof proceeds by induction on the length of the derivations. □ 

Let us consider a transformation sequence Pq, . . . ,P n constructed by using the transformation 
rules according to the hypothesis of Theorem H3 For reasons of simplicity we assume that each 
definition clause is used for folding, and thus, by Condition ^ of Theorem it is unfolded during the 
construction of Pq, . . . , P n . We can rearrange the sequence Po, . . . , P n into a new sequence Po, . . . , Pq U 
Defs n , . . . , Pj, . . . , P[, . . . , P n such that: (1) Pq, . . . , Po U Defs n is constructed by applications of the 
definition introduction rule, (2) Po U Defs n , ■ ■ ■ ,Pj is constructed by unfolding every clause in Defs n , 
(3) Pj, . . . , P is constructed by applications of rules and (4) either I = n or / = n — 1 and P n is 
derived from P n _i by an application of the definition elimination rule w.r.t. predicate p. 

Throughout the rest of this section we will refer to the transformation sequence Po , . . . , Po U 
Defs n , . . . , Pj, . . . , P n constructed as indicated above. We also assume that M is a mode for Po U Defs n 
such that: (i) Po U Defs n is safe w.r.t. M, (ii) Pq U Defs n satisfies M, and (iii) the applications of the 
unfolding, folding, head generalization, and case split rules during the construction of Po, . . . ,P n are 
all safe w.r.t. M. 

Thus, by Propositions and 21 for k = 0, ... ,n, program P& is safe and satisfies M. 

Lemma 10 Let us consider the transformation sequence Po, . . . , Po U Defs n , ■ ■ ■ ,Pj constructed as 
indicated above. Then the following properties hold. 

(i) For all clauses newp(X±, . . . , Xh) <— Body in Defs n , for all substitutions ■&, and for all goals G\, G 2 , 
such that all derivations from (G\, Body®, G 2 ) using Pj are consistent with M, we have that: 

(1.1) (Gi, Bodyti, G 2 ) y Pj (Gi, newp(X 1 , . . . ,X h )d, G 2 ); 

(1.2) all derivations starting from (G\, newp(X\, . . . G 2 ) using Pj are consistent with M; 

(ii) for all non-basic atoms A satisfying M, if A succeeds in Po U Defs n then A succeeds in Pj. 

Notice that, by Point (i.l), if (Gi, Bodyd, G 2 ) succeeds in Pj then (G\, newp(X\, . . . , Xhjfi, G 2 ) 
succeeds in Pj. 

Proof: By induction on the length of the derivations. □ 

For the proof of the following Lemma IT21 we will use the following property. 

Lemma 11 Let us consider the transformation sequence Pj, . . . ,P and the mode M for Pq U Defs n 
as indicated above. For k = j, . . . ,1 and for all goals G\ and G 2 such that there exists a derivation 
Gi i — >p k . . . i — >p k G 2 , if all derivations from G\ using Pj are consistent with M then all derivations 
from G 2 using Pj are consistent with M. 

Proof : The proof proceeds by induction on k and on the length of the derivation G\ i — > p k ■ ■ ■ i — ► p k 
G 2 . We omit the details. □ 
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Lemma 12 Let us consider the transformation sequence Pj, . . . ,Pi and the mode M for Pq U Defs n 
as indicated above. Let G be a goal such that (i) no disequation occurs in G and (ii) all derivations 
from G using Pj are consistent with M . For k = j, . . . ,1, if G has a successful derivation in Pj, then 
G has a successful derivation in P^ which is quasi-decreasing w.r.t. >- Pj - 

Proof: Let us consider the following ordering on goals: 

Gi > G 2 iff either G x y P] G 2 or G x hp, G 2 and v{Pj,G x ) > u(Pj,G 2 ). 

> is a well-founded order. 

The proof proceeds by induction on k. 

Base Case. The case k = j follows from Lemma |HJ 

Step Case. For k > j we assume the following: 

Inductive Hypothesis (II). For each goal G' such that no disequation occurs in G' and all derivations 
from G' using Pj are consistent with M, if G 1 has a successful derivation in Pj, then G' has a successful 
derivation in which is quasi-decreasing w.r.t. ^p r 

Let us now consider a goal G of the form (Aq, Go) such that no disequation occurs in (Aq, Go) and all 
derivations from (^4o> Go) using Pj are consistent with M. Let us assume that there exists a derivation 
of the form: 

5 : (A , Go) i — > Pk . . . i — > Pk true 

which is quasi-decreasing w.r.t. y Pj - 

We wish to show that there exists a derivation of the form: 

8' : (Aq, G ) i — >p k+1 . . . i — >p k+1 true 
which is quasi-decreasing w.r.t. y Pj - We prove the existence of such a derivation 5' by induction on 
the well-founded order [>. 
We assume the following: 

Inductive Hypothesis (12). For each goal G such that no disequation occurs in G and all derivations 
from G using Pj are consistent with M and (Ao, Go) > G, if there exists a derivation of the form: 

G i — > Pk . . . i — > Pk true 
which is quasi-decreasing w.r.t. >-pp then there exists a derivation of the form: 

G i — > Pk+1 . . . i — > Pk+1 true 
which is quasi-decreasing w.r.t. >-Pj- 
Now we proceed by cases. 

Case 1: Aq is the equation t\ =t 2 . By Point (1) of the operational semantics of Section 12.31 the 
derivation S is of the form: 

(h = t 2 , Gq) i — >p k Go mgu(ti,t 2 ) i — > Pk . . . i — > Pk true 

Let us consider the derivation: 

G mgu(h,t 2 ) i — > Pk . . . i — > Pk true 

By Proposition we have that both (t± = t 2 , Go) and Go mgu(t\,t 2 ) succeed in Pj. Moreover, by Point 
(1) of the operational semantics v(Pj, (t± = t 2 ,Go)) > v(Pj, Go mgu{t\, t 2 )). Thus, (ii = ^Go) O 
Go mgu{t\,t 2 ) and, by the inductive hypothesis (12), there exists a successful derivation of the form: 

G mgu(t 1 ,t 2 ) i — > Pk+1 ■ ■ ■ i — > Pk+1 true 
which is quasi-decreasing w.r.t. >~ Pj - Since (t\ = t 2 ,Go) h Pj Go mgu(t\,t 2 ), the following derivation: 

{ti = t 2 ,G ) i — > Pk+1 G mgu(ti,t 2 ) i — > Pk+1 . . . i — > Pk+1 true 
is quasi-decreasing w.r.t. >- Pj . 

Case 2: Aq is a non-basic atom which satisfies M (otherwise there is no derivation starting from 
{Aq,Go) which is consistent with M). By Point (3) of the operational semantics, in P^ there exists a 
renamed apart clause G, such that the derivation 5 is of the form: 

(A ,G Q ) i — > Pk (bd(C),Go)mgu(A , hd(C)) i — > Pk ... i — > Pfc trwe 
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By Proposition ^ we may assume that clause G is of the form H <— Diseqs, B, where Diseqs is a con- 
junction of disequations and B is a goal without occurrences of disequations. Thus, Diseqs mgu(Ao, H) 
succeeds and 5 is of the form: 

(Aq,G ) i — >p k (Diseqs, B, G ) mgu(A ,H) i — > Pk ... i — > Pk (B,Go)mgu(A , H) i — > Pk ... \ — > Pk 
true 

If C G P k+ i then (A), G ) 1 — >P fc+1 [Diseqs, B, G )mgu(A , H) \ — > Pk+1 . . . i — > Pk+1 (B, G )mgu(A , H) 
and the thesis follows from the inductive hypothesis (12), because (Aq,Go) >~ P] (B,Go)mgu(Ao, H) 
(recall that 5 is quasi-decreasing w.r.t. >-p-). 

Otherwise, if C € (Pk — P^+i), we construct the derivation 5' by considering the following cases, which 
correspond to the rules applied for deriving Pk+i from Pk- 

Case 2.1: Pk+i is derived by unfolding clause C in Pk w.r.t. a non-basic atom, say A. Thus, clause C 
is of the form H <— Diseqs, G\,A, G 2 . Let Ci, . . . , C m , with m > 0, be the clauses of Pfc such that, 
for i = I, . . . ,m, A is unifiable with the head of Gj. Thus, Pfc+i = (Pk — {G}) U {-Di, . . . , D m }, 
where for i = l,...,m, Di is the clause (H <— Diseqs, G\, bd(Ci), G2) mgu(A,hd(Ci)). For 
reasons of simplicity we assume that for i = 1, ... ,m, no disequation occurs in bd(Ci). In the 
general case where, for some i € {1, . . . ,m}, bd(Ci) has occurrences of disequations, the proof 
proceeds in a very similar way, by using Proposition ^ Lemma E3 and the hypothesis that all 
applications of the unfolding rule are safe (see Definition 0J) . 

The derivation 5 is of the form: 

(A ,G ) 1 — > Pk (Diseqs,Gi,A,G 2 ,G )mgu(A ,H) 1 — > Pk ... 1 — > Pk true 

From the fact that 5 is quasi-decreasing w.r.t. ^p j 7 from Point (1) of the operational semantics, 
and from the definition of ^p j 5 we have that: 

(Ao, G ) y Pj (A = H,Diseqs,G 1 ,A,G 2 ,G ) 
and the derivation 

(A Q = H, Diseqs, G\,A,G 2 , Go) 1 — >P k — >P k true 
is quasi-decreasing w.r.t. >~p r 

Thus, by Points (1) and (3) of the operational semantics, there exists a clause in P^, say Ci, 
such that the derivation 

(A Q = H, Diseqs, Gi,A = hd(Ci),bd(Ci),G 2 , G ) 1 — > Pk — > Pk true 
is quasi-decreasing w.r.t. y P -. Moreover, we have that: 

(A), G ) y Pj (A = H, Diseqs, G 1 ,A = hd(C i ),bd(C i ),G 2 ,G ). 

Since all derivations from (Aq, Go) using Pj are consistent with M, we have that all derivations 
from (Ao = H, Diseqs, G\) using Pj are consistent with M, and therefore, by Lemma El all 
derivations from (A = H,G\) using P^ are consistent with M. Then, since no disequation 
occurs in G\, by Lemma there exists a derivation 

(A = H, Diseqs, A = hd(Ci),Gi,bd(Ci),G 2 , G ) 1 — > Pk — > Pk true 
which is quasi-decreasing w.r.t. y P -. Moreover, we have that: 

(Ao, Go) y Pj (A = H, Diseqs, A = hd(C i ),G 1 ,bd(C i ),G 2 ,Go). 
Now, since by Lemma |21 all clauses in P^ are safe, we have that: 

vars(Diseqs mgu(Ao, H)) n vars((A = hd(Ci))mgu(Ao, H)) = 
and therefore, by using properties of mgu's, there exists a derivation 

(A = hd(Ci), Ao = H, Diseqs, Gi,bd(Ci),G 2 , Go) 1 — > Pk — > Pk true 

which is quasi-decreasing w.r.t. >~ Py Let be mgu(A, hd(C{)) and r\i be mgu(Ao, H i?i)). By 
Points (1) and (2) of the operational semantics, we have that Diseqs "dirji succeeds and there 
exists a derivation of the form 
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((Gi,bd(C i ),G 2 )^i,G )r] i i — > Pk ...i — > Pk true 
Moreover, we have that: 

(A ,G ) y Pj ((Gi,bd{C i ),G 2 )# i ,Go)m (*) 
and thus, by the inductive hypothesis (12), there exists a derivation of the form 

((G 1 ,bd(Ci),G 2 )tfi,G )7]i i — > Pk+1 ... i — > Pk+1 true 

which is quasi-decreasing w.r.t. >-py 

Since Diseqs rji succeeds, by using clause Di in Pk+i for the first step, we can construct the 
following derivation: 

(A ,G ) i — > Pk+1 ((Diseqs, G 1 ,bd(Ci),G 2 )tfi,G )rii i — > Pk+1 ... i — > Pk+1 true 

which, by property (*), is quasi-decreasing w.r.t. >-p y 

Case 2.2: Pk+i is derived from Pk by a safe application of the folding rule (see Definition EJ) . In 
particular, suppose that clause C is one of the following clauses occurring in P^: 

Ci.H*- Diseqs, G u (A l ,K 1 )^, G 2 
C m . H <- Diseqs, d, (A m , K m )<d, G 2 

where Diseqs is a conjunction of disequations and no disequation occurs in (Gi,G 2 ). We also 
suppose that the following definition clauses occur in Defsj-: 

D 1 . newp(X 1 , . . . ,X h ) «- A 1 ,K 1 

D m . newp(X 1 , X h ) <- A m , K m 

and we have derived a clause E of the form: 

E. H <— Diseqs, G\, newp(X\, . . . , X^d, G 2 

where Property S of Definition El holds, that is, each input variable of newp(X\, . . . is 
also an input variable of at least one of the non-basic atoms occurring in (H, Gx^Axfi, . . . , A m -d). 

Thus, P k+1 = (P k - {Ci, . . . , C rn }) U {E}. 

We may assume, without loss of generality, that clause C is C\, and the derivation 5 is of the 
form: 

(A ,G ) i — > Pk (Diseqs, G 1 ,(A 1 ,K 1 )tf,G 2 , G ) mgu(A ,H) i — > Pk ... i — > Pk true 

Thus, Diseqs mgu(AQ, H) succeeds and, since 8 is consistent with M, by LemmaEl we have that 
(G±, (A\, Ki)i}, G 2 , Go)mgu(Ao, H) succeeds in Pj. 

Moreover, by Lemma ITT1 all derivations from (Gi, (A\, Kx)$, G 2 , Go)mgu(Ao, H) using Pj are 
consistent with M. 

Thus, by Lemmata 151 and HUl all derivations from (G\,newp(X\, . . . , Xt l )$,G 2 ,Go)mgu(Ao, H) 
using Pj are consistent with M and there exists a derivation of the form: 

(Gi, newp(X 1 , . . . , X h )i}, G 2 , G )mgu(A Q , H) \ — > Pj . . . i — > P . true 

which is quasi-decreasing w.r.t. >-p r 

No disequation occurs in (G\, newp(X\, . . . , X^)i9, G 2 , Go)mgu(Ao, H), and thus, by the induc- 
tive hypothesis (II), there exists a derivation of the form: 

(Gi, newp(X 1 , . . . , X h )i}, G 2 , G )mgu(A Q , H) \ — > Pk . . . i — > Pk true 

which is quasi-decreasing w.r.t. >-p r 

Since 5 is quasi-decreasing w.r.t. >-Pj, by Lemma fTUl we also have that: 
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(A ,G ) > (G 1 ,newp(X 1 ,...,X h )#,G 2 ,G )mgu(A ,H) 
Thus, by the Inductive hypothesis (12), there exists a derivation 

(Gi, newp(X 1 , . . . , X h )i}, G 2 , G )mgu(A Q , H) \ — > Pk+1 . . . i — > Pk+1 true 
which is quasi decreasing w.r.t. >-p r 

Since Diseqs mgu(Ao, H) succeeds, by using clause E G Pk+i, we can construct the following 
derivation 

(A ,G Q ) i — > Pk+1 (Diseqs, Gi,newp(X 1 ,...,X h )'&,G 2 ,Go)mgu(A ,H) i — > Pk+1 ... i — > Pk+1 

true 

which is quasi-decreasing w.r.t. >- Pj because: 

(Aq,Gq) >- Pj (Diseqs, Gi,(Ai,Ki)'&,G 2 ,Go)mgu(Ao,H) (because 5 is quasi-decreasing) 
^ Pj (Diseqs, Gi,newp(Xi, ... ,Xh)$,G 2 ,Go)mgu(Ao,H) (by Lemma ITU]) 

Case 2.3: Pk+i is derived by deleting clause C from Pj. by applying the subsumption rule. Thus, 
clause C is of the form (H <— Diseqs, GijG^fi and there exists a clause D in Pk of the form 
H <— Diseqs, G\. By Proposition ^ we may assume that no disequation occurs in G\. 

Thus, the derivation (5) is of the form: 

(A), G ) 1 — >P k ((Diseqs, G 1 ,G 2 )'&, G ) mgu(A ,Hi9) i — > Pfe ... i — > Pfc irite 

Since all derivations starting from (Aq,Go) using Pk are consistent with M and, by using 
clause D, (Aq, Go) i — >p fc (Diseqs, G\, Go)mgu(Ao, H), we have that all derivations starting from 
(Diseqs, G\, Go)mgu(Ao, H) using Pk are consistent with M. Moreover, no disequation occurs 
in Go and therefore, by Lemma |HJ there exists a derivation 

(A Q ,G ) i — > Pk (Diseqs, G 1 ,G )mgu(A Q ,H) i — > Pk ... i — > Pfc £rae 

which is quasi-decreasing w.r.t. >-p-. Thus, (Diseqs mgu(Ao, H)) succeeds and there exists a 
derivation 

(Gi, G )mgu(^o, F) i — . . . i — > Pk true 

which is quasi-decreasing w.r.t. >-p.. Since (^4o,Go) > (G%, Go)mgu(Ao, H), by the inductive 
hypothesis (12), there exists a derivation 

(Gi,G )mgu(A , H) i — > Pfc+1 . . . i — > Pfc+1 irae 

which is quasi-decreasing w.r.t. >-p.. Since D belongs to Pfc+i and (Diseqs mgu(Ao, H)) succeeds, 
there exists a derivation 

(A ,G ) i — > Pk+1 (Diseqs, G±, G )mgu(A , H) i — > Pfc+1 ... i — > Pfc+1 irae 

which is quasi-decreasing w.r.t. >- Pj . 

Case 2.4: P^+i is derived from Pk by applying the head generalization rule to clause G. Thus, G is 
of the form H{X/t} <— i?oc??/ and P^+i = (-Pfc — {C}) U {GenG}, where clause GenC is of the 
form H <— X = t, Body. 

In this case we can show that we can construct the derivation 5' which is quasi-decreasing w.r.t. 
>- Pj , by using (i) Point (1) of the operational semantics, (ii) the inductive hypothesis (12) and 
(hi) the fact that, for all goals of the form (t\ =t 2 , G), where t\ and t 2 are unifiable terms, and 
for all programs P, n(P, (ti=t 2 ,G))=fj,(P, Gmgu(t\,t 2 )). 

Case 2.5: Pk+i is derived from Pk by applying the safe case split rule (see Definition EJ) to clause G. 
By Proposition ^ we may assume that G is a clause of the form H *— Diseqs, B, where Diseqs 
is a conjunction of disequations and B is a goal without occurrences of disequations. We also 
assume that from G we have derived two clauses of the form: 

Gi. (H <- Diseqs, B){X/t} 
G 2 . H <- X + t, Diseqs, B 
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where X is an input variable of H, X does not occur in t, and for all variables Y € vars(t), 
either Y is an input variable of H orY does not occur in C. 

We have that -Pfc+i = (Pfe — {C}) U {C\, C2}. The derivation 5 is of the form: 

(A , Go) 1 — > Pk {Diseqs, B,G ) mgu(A , H) 1 — > Pk . . . 1 — > Pk true 

Thus, (Diseqs mgu(Ao, H)) succeeds and, since 5 is quasi-decreasing, we have that (Aq,Go) > 
(B,Go) mgu(Ao, H). The goal (B,Go) mgu(Ao, H) has no occurrences of disequations and, by 
the inductive hypothesis (12), there exists a derivation 

(B, G ) mgu(A , H) 1 — > Pk+1 . . . 1 — > Pk+1 true 

which is quasi-decreasing w.r.t. >- p . Since (Diseqs mgu(Ao, H)) succeeds, there exists a deriva- 
tion 

(Diseqs, B, Go) mgu(A ,H) 1 — > Pk+1 . . . 1 — > Pk+1 true 
which is quasi-decreasing w.r.t. y P -. 

Since X is an input variable of H, there exists a binding X/u in mgu(Ao, H) where u is a ground 
term. We consider the following two cases. 

Case A: t and u are unifiable, and thus, u is an instance of t. In this case Aq and H{X/t} are 
unifiable and, by the hypotheses on X/t, we have that: 

(Diseqs, B, G ) mgu(A ,H) = ((Diseqs, B){X/t}, G ) mgu(A , H{X/t}) 
Thus, we can construct a derivation of the form: 

(A ,G ) ^p k+1 ((Diseqs, B){X/t}, Go) mgu(A ,H{X/t}) ^ Pk+1 ...^ Pk+1 true 
which is quasi-decreasing w.r.t. y P -. 

Case B: t and u are not unifiable. Thus, (X ^t)mgu(Ao, H) succeeds and the following derivation 
is quasi-decreasing w.r.t. y Pj - 

(A), G ) ' — >p k+1 (X + t, Diseqs, B, G ) mgu(A ,H) 

1 — 'Pk+i (Diseqs, B,G ) mgu(A , H) 1 — > Pk+1 . . . 1 — > Pk+1 true 

Case 2.6: Pk+i is derived from by applying the equation elimination rule to clause C. In this case 
the existence of a derivation 

(A , Go) 1 — > Pk+1 . . . ' — > Pk+1 true 

which is quasi-decreasing w.r.t. >-p., can be proved by using (i) the inductive hypothesis (12), 
(ii) Point (1) of the operational semantics, (iii) the fact that P/% is safe and satisfies M, and (iv) 
Lemma 

Case 2.7: Pk+i is derived from Pj~ by applying the disequation replacement rule to clause C. In this 
case the existence of a derivation 

(A , Go) 1 — > Pk+1 . . . ' — > Pk+1 true 

which is quasi-decreasing w.r.t. >-p., can be proved by using (i) the inductive hypothesis (12), 
(ii) Point (2) of the operational semantics, and (iii) the properties of unification. □ 

Lemma 13 Let us consider the transformation sequence Pj, . . . ,Pi and the mode M for Po U Defs n 
as indicated above. For k = j, . . . ,1, for each non-basic atom A which satisfies mode M, if A succeeds 
in Pj then A succeeds in P^. 

Proof: It follows from Lemma WR because if an atom A satisfies M and succeeds in Pj, then A has 
a successful derivation in Pj which is consistent with M and quasi-decreasing w.r.t. >- Pj - Indeed, by 
Proposition 0J Pj satisfies M, and thus, all derivations starting from A are consistent with M. □ 

Lemma 14 If program P n is derived from program P n -i by an application of the definition elimination 
rule w.r.t. a non-basic predicate p, then for each atom A which has predicate p, if A succeeds in 
Po U Defs n then A succeeds in P n . 
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Proof: If A has predicate p then p depends on all clauses which are used for any derivation starting 
from A. Thus, every derivation from A using Po U Defs n is also a derivation using P n . □ 

Proposition 6 (Completeness) Let Pq, . . . , P n be a transformation sequence constructed by using 
the transformation rules and let p be a non-basic predicate in P n . Let M be a mode for Pq U Defs n 
such that: (i) Pq U Defs n is safe w.r.t. M, (ii) Pq U Defs n satisfies M, and (iii) the applications of the 
unfolding, folding, head generalization, and case split rules during the construction of Pq, . . . ,P n are 
all safe w.r.t. M. Suppose also that: 

1. if the folding rule is applied for the derivation of a clause G in program Pk+i from clauses 
Ci, . . . , C m in program P& using clauses D\, . . . , D m in Defsk, with 0<k<n, 

then for every i £ {1, . . . , m} there exists j G {1, . . . , n— 1} such that _Dj occurs in Pj and P/+i 
is derived from Pj by unfolding D{. 

2. during the transformation sequence PQ,...,P n the definition elimination rule either is never 
applied or it is applied w.r.t. predicate p once only, when deriving P n from P n _i- 

Then for each atom A which has predicate p and satisfies mode M, if A succeeds in Pq U Defs n then 
A succeeds in P n . 

Proof : Let us consider a transformation sequence Pq , . . . , P n constructed by using the transformation 
rules according to conditions Q and [2j 

As already mentioned, we can rearrange the sequence Pq, . . . ,P n into a new sequence Pq, . . . , Pq U 
Defs n , . . . , Pj , . . . , Pi, . . . , P n such that: (1) Po, . . . ,Pq U Defs n is constructed by applications of the 
definition introduction rule, (2) Po U Defs n , ■ ■ ■ ,Pj is constructed by unfolding every clause in Defs n , 
(3) Pj, . . . , Pi is constructed by applications of rules I2HH1 and (4) either (4.1) I = n or (4.2) I = n — 1 
and P n is derived from P n _i by an application of the definition elimination rule w.r.t. predicate p. 

Thus, Proposition follows from Lemmata I1U1 IT3*l and 1141 □ 

Appendix B. Proof of Proposition [2] 

For the proof of Proposition |2] we need the following two lemmata. 

Lemma 15 Let us consider a program P and a conjunction D of disequations. D succeeds in P iff 
every ground instance of D holds. 

Proof: Let us consider the conjunction (r± ^ s\, . . . , r^ ^ s^) of disequations. Every ground instance 
of (ri 7^ si, . . . , rfc ^ Sfc) holds iff for i = 1, . . . , k, and for every ground substitution a, r^a ^ Sjcr holds 
iff for i = 1, . . . , k, and for every ground substitution a, r^a is a ground term different from Sj<7 iff for 
i = 1, . . . , k, it does not exist a ground substitution a such that r^c and Sjcr are the same ground term 
iff for i = 1, . . . , k, ri and Si are not unifiable iff (ri ^s\, . . . , rk^Sk) succeeds in P. □ 

Lemma 16 Let P be a program which is safe w.r.t. mode M and satisfies mode M . Let the non-unit 
clauses of P be pairwise mutually exclusive w.r.t. mode M. Given any non-basic atom which 
satisfies M, and any basic goal Go, there exists at most one goal {A\, G\) such that A\ is a non-basic 
atom and (^OjGo) =^p (Ai,G\). 

Proof : By the definition of the p relation (see Section I2.4j) , we need to prove that for any non-basic 
atom Aq which satisfies M, and any basic goal Go, there exists at most one goal {A\, G\) where A\ is 
a non-basic atom, such that: (i) (Aq,Gq) i — >* P (A\,Gi), and (ii) the relation i — >* P is constructed by 
first applying exactly once Point (3) of our operational semantics, and then applying to the resulting 
goal Points (1) and (2) of our operational semantics, as many times as required to evaluate the leftmost 
basic atoms, if any. 

Since the non-unit clauses of P are pairwise mutually exclusive w.r.t. M, for any given non-basic 
atom Aq which satisfies M, there exists at most one non-unit clause, say G, of P such that Aq unifies 
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with hd(C) via an mgu, say fj,, and grd(C)fi succeeds in P. In fact, suppose to the contrary, that 
there were two such non-unit clauses, say C\ and G2. Suppose that, for j = 1, 2, clause Cj is renamed 
apart and it is of the form: 
Cj. p(tj,uj) <- grdj, Kj, 

where: (i) tj is a tuple of terms denoting the input arguments of p and (ii) the goal grdj is the guard 
of Cj, that is, a conjunction of disequations such that the leftmost atom of the goal Kj is not a 
disequation. 

Suppose that for j = 1, 2, hd(Cj) unifies with Aq via the mgu $j. Since Aq satisfies M, for j = 1, 2, 
the input variables of hd(Cj) are bound by $j to ground terms. Since t\ and t<i have a common ground 
instance, namely ii$i(= £2^2)1 they have a relevant mgu $ whose domain is a subset of vars{t\,t2), 
and there exists a ground substitution a with domain vars(t\,t2) such that ii^i = iii?<7(=i2'$2 = t2'&o~). 
Moreover, since the clauses C\ and C2 are renamed apart, we have that: 

(Property a) for j = l,2, if we restrict fta to varsitj) then r &j = r da. 
By hypothesis, both grdi~&\ and grd 2 '&2 succeed in P. Thus, by Lemma [T5l every ground instance of 
grd^&x and grd 2 & 2 holds. (Recall that the goals grdi$i and grd 2 '&2 are ground goals, except for the 
local variables of each disequation occurring in them.) 

Since P is safe w.r.t. M, for j = 1, 2, every variable occurring in a disequation of grdj either occurs 
in tj or it is a local variable of that disequation in Cj. Thus, by Property (a), grd^i = grd^a and 
grd 2 r d 2 = grd 2 da. Since every ground instance of grd^dx and grd 2 d 2 holds, we have that every ground 
instance of (grd^a, grd 2 'dcr) holds. In other words, there exists a ground substitution a whose domain 
is vars(t\,t2), such that every ground instance of (grd^, grd 2 )'&o~ holds. By definition, this means that 
(grd l , grd 2 )$ is satisfiable w.r.t. vars(t\,t2). This contradicts the fact that the non-unit clauses of P 
are mutually exclusive w.r.t. M . 

We conclude that for any given non-basic atom Aq which satisfies M, Aq unifies via an mgu, say 
/i, with the head of at most one non-unit clause, say C, of P such that grd(C)fi succeeds in P. 

Now there are two cases: (Case i) Aq unifies with the head of the clauses in {C, D\, . . . , D n }, where 
n>0, C is a non-unit clause, and clauses D\, . . . ,D n are all unit clauses, and (Case ii) Aq unifies with 
the head of the clauses in {D\, . . . , D n }, where n>0 and these clauses are all unit clauses. 

Let us consider Case (i). Let clause C be of the form: H <— K for some non-basic goal K. For any 
basic goal Go, by applying once Point (3) of our operational semantics, we have that: (Aq,Gq) i — >p 
{K,Gq)[i. Thus, (K,Gq)ij, is of the form (Bs,G2) where Bs is a conjunction of basic atoms and the 
leftmost atom of G2 is non-basic. Since for any basic atom B and goal G3, there exists at most one 
goal G\ such that {B, G3) 1 — >p G4, by using Points (1) and (2) of our operational semantics, we have 
that there exists at most one goal (Ai,G\) such that (Bs, G2) 1 — >p (Ai,G\), where the atom A\ is 
non-basic. 

Every other derivation starting from (Aq,Gq) by applying Point (3) of our operational semantics 
using a clause in {D\, ■ ■ ■ , D n }, is such that if for some goal G5 we have that (Aq, Gq) i — >* P G5, then 
G5 is a basic goal, because from a basic goal we cannot derive a non-basic one. This concludes the 
proof of the Lemma in Case (i). 

The proof in Case (ii) is analogous to that of the last part of Case (i). □ 

Now we give the proof of Proposition |2 

Proof: Take a non-basic atom A which satisfies M. Every non-basic atom Aq such that A 1 — >* P 
(Aq, Gq) for some goal Go, satisfies M because P satisfies M. Since P is linear, Go is a basic goal. By 
Lemma^Jthere exists at most one goal (A\, G\) where A\ is a non-basic atom, such that (^4o ; Go) =^p 
(A\,G\). Thus, there exists at most one non-unit clause G in P such that (^4o,Go) =^c (Ai,G\). 
This means that P is semideterministic w.r.t. M. □ 

Appendix C. Proof of Proposition [S] 

Proof: It is enough to show that the while-do statement in the Partition procedure terminates. To 
see this, let us first consider the set NonunitClsi n which is the value of the set NonunitCls at the 
beginning of the execution of the while-do statement. NonunitChi n can be partitioned into maximal 
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sets of clauses such that: (i) two clauses which belong to two distinct sets, are mutually exclusive, and 
(ii) if two clauses, say Co and C n+ \, belong to the same set, then there exists a sequence of clauses 
Co, C\ , . . . , C n _|_i , with n > 0, such that for i = 0, . . . , n, clauses Cj and C+i are not mutually exclusive. 

For our termination proof it is enough to show the termination of the Partition procedure when 
starting from exactly one maximal set, say K, of the partition of NonunitClsi n . This is the case 
because during the execution of the Partition procedure, the replacement of a clause, say C2, by the 
clauses, say C21 and C22, satisfies the following property: if clauses C2 and D are mutually exclusive 
then C21 and D are mutually exclusive and also C22 and D are mutually exclusive. 

Let every clause of K be renamed apart and written in a form, called equational form, where the 
input arguments are generalized to new variables and these new variables are bound by equations in 
the body. The equational form of a clause C will be denoted by C eq . For instance, given the clause C: 
p{f{X),r(Y,Y),r{X,U)) «- Body, with mode +, ?) for p, we have that C eq is: p{V,W,r(X)) «- 
V = f(X),W = r(Y,Y),Body. 

Let K eq be the set {C eq | C G K}. Thus, K eq has the following form: 

!p(vi,ui) <— Eqs 1 , Diseqsi, Bodr/i 
p(v n , u n ) <- Eqs n , Diseqs n , Body n 

where, for i = 0, . . . , n: (1) v i denotes a tuple of variables which are the input arguments of p, (2) u; L 
denotes a tuple of arguments of p which are not input arguments, (3) Eqs i denotes a conjunction 
of equations of the form X = t, which bind the variables in Vi, (4) Diseqs i denotes a conjunction 
of disequations, and (5) Body i denotes a conjunction of atoms which are different from disequations 
(recall that the clauses in NonunitClsi n are in normal form). Equations may occur also in Body i , but 
they do not bind any input variable of p(vi,Ui). 

Let us now introduce the following set T = {t \ t is a term or a subterm occurring in Eqst or 
Diseqs i for some i = 1, . . . , n). 

Every execution of the body of the while-do statement of the Partition procedure works by replacing 
a safe clause, say C2, by two new safe clauses, say C21 and C22- We will prove the termination of the 
Partition procedure by: (i) mapping the replacements it performs, onto the corresponding replacements 
of the clauses written in equational form in the set K eq , and (ii) showing that the set K eq cannot 
undergo an infinite number of such replacements. 

Let us then consider the equational forms C^, C|f, and C^f of the clauses C2, C21, and C22, 
respectively. We have that: (i) bd^C^f) has one more equation of the form X = r w.r.t. fo^C^ 9 ), and 
(ii) bd^C^) has one more disequation of the form X^r w.r.t. ^(C^ 9 ). We also have that there exists 
only a finite number of pairs (X,r), because X is a variable symbol occurring in K eq and r is a term 
occurring in the finite set T U {t \ t is a term or a subterm occurring in an mgu of a finite number of 
elements of T}. (We have considered mgu's of a finite number of elements of T, rather than mgu's of 
two elements only, because a finite number of clause heads in K may have the same common instance.) 

Thus, in order to conclude the proof, it remains to show that before the replacement of C2 by C21 
and C22, neither X = r nor X^r occurs in ta^Cl 9 ). Here and in the rest of the proof, the notion of 
occurrence of an equation or a disequation is modulo renaming of the local variables. Indeed, 

— in Case (1): (1.1) X^r does not occur in frc^C^ 9 ) because X/r is a binding of an mgu of the input 
arguments of hd{C\) and hd{C2), and clauses C\ and C2 are not mutually exclusive, and thus, X^r 
does not occur in bd{C2), and (1.2) X = r does not occur in M(C| 9 ) because X/r is, by construction, 
a binding of an mgu between the input arguments of the heads of the clauses C\ and C2 and these 
clauses are obtained as a result of the Simplify function which eliminates every occurrence of the 
variable X from C2, and 

— in Case (2): (2.1) X = r does not occur in bd{C^) because, by hypothesis, a variant of X^r occurs 
in bd(C\) and clauses C\ and C2 are not mutually exclusive, and (2.2) X^r does not occur in bd^C^) 
because X^r does not occur in bd(C2) (indeed, we choose X^r precisely to satisfy this condition). □ 
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